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Preface 


This book is the eighth edition of an elementary text on solid state/ 
condensed matter physics for seniors and beginning graduate students of the 
physical sciences, chemistry, and engineering. In the years since the first edi- 
tion was published the field has developed vigorously, and there are notable 
applications. The challenge to the author has been to treat significant new 
areas while maintaining the introductory level of the text. It would be a pity to 
present such a physical, tactile field as an exercise in formalism. 

At the first edition in 1953 superconductivity was not understood; Fermi 
surfaces in metals were beginning to be explored and cyclotron resonance in 
semiconductors had just been observed; ferrites and permanent magnets were 
beginning to be understood; only a few physicists then believed in the reality of 
spin waves. Nanophysics was forty years off. In other fields, the structure of 
DNA was determined and the drift of continents on the Earth was demon- 
strated. It was a great time to be in Science, as it is now. I have tried with the 
successive editions of ISSP to introduce new generations to the same excitement. 

There are several changes from the seventh edition, as well as much 
clarification: 

• An important chapter has been added on nanophysics, contributed by an 
active worker in the field, Professor Paul L. McEuen of Cornell University. 
Nanophysics is the science of materials with one, two. or three small dimen- 
sions, where “small” means (nanometer 10 -9 m). This field is the most excit- 
ing and vigorous addition to solid state science in the last ten years. 

• The text makes use of the simplifications made possible by the universal 
availability of computers. Bibliographies and references have been nearly 
eliminated because simple computer searches using keywords on a search 
engine such as Google will quickly generate many useful and more recent 
references. As an example of what can be done on the Web, explore the 
entry http://www.physicsweb.org/bestof/cond-mat. No lack of honor is in- 
tended by the omissions of early or traditional references to the workers 
who first worked on the problems of the solid state. 

• The order of the chapters has been changed: superconductivity and 
magnetism appear earlier, thereby making it easier to arrange an interesting 
one-semester course. 

The crystallographic notation conforms with current usage in physics. Im- 
portant equations in the body of the text are repeated in SI and CGS-Gaussian 
units, where these differ, except where a single indicated substitution vail 
translate from CGS to SI. The dual usage in this book has been found helpful 
and acceptable. Tables arc in conventional units. The symbol e denotes the 
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charge on the proton and is positive. The notation (18) refers to Equation 18 
of the current chapter, but (3.18) refers to Equation 18 of Chapter 3. A caret (') 
over a vector denotes a unit vector. 

Few of the problems are exactly easy: Most were devised to carry forward 
the subject of the chapter. With few exceptions, the problems are those of the 
original sixth and seventh editions. The notation QTS refers to my Quantum 
Theory of Solids, with solutions by C. Y. Fong; TP refers to Thermal Physics, 
with H. Kroemer. 

This edition owes much to detailed reviews of the entire text by Professor 
Paul L. McEuen of Cornell University and Professor Roger Lewis of Wollongong 
University in Australia. They helped make the book much easier to read and un- 
derstand. However, I must assume responsibility for the close relation of the text 
to the earlier editions, Many credits for suggestions, reviews, and photographs 
are given in the prefaces to earlier editions. I have a great debt to Stuart Johnson, 
my publisher at Wiley; Suzanne Ingrao, my editor; and Barbara Bell, my per- 
sonal assistant. 

Corrections and suggestions will be gratefully received and may be ad- 
dressed to the author by email to kittel@berkeley.edu. 

The Instructors Manual is available for download at: www.wiley.com/ 
college/kittel. 


Charles Kittel 
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(c) 

Figure X Relation of the external form of crystals to the form of the elementary building blocks. 
The building blocks are identical in (a) and (b). but different crystal faces are developed, 
(c) Cleaving a crystal of rocksalt. 



CHAPTER 1: CRYSTAL STRUCTURE 


PERIODIC ARRAYS OF ATOMS 

The serious study of solid state physics began with the discovery of x-ray 
diffraction by crystals and the publication of a series of simple calculations of 
the properties of crystals and of electrons in crystals. Why crystalline solids 
rather than noncrystalline solids? The important electronic properties of solids 
are best expressed in crystals. Thus the properties of the most important semi- 
conductors depend on the crystalline structure of the host, essentially because 
electrons have short wavelength components that respond dramatically to the 
regular periodic atomic order of the specimen. Noncrystalline materials, no- 
tably glasses, are important for optical propagation because light waves have a 
longer wavelength than electrons and see an average over the order, and not 
the less regular local order itself. 

We start the book with crystals. A crystal is formed by adding atoms in a 
constant environment, usually in a solution. Possibly the first crystal you ever 
saw was a natural quartz crystal grown in a slow geological process from a sili- 
cate solution in hot water under pressure. The crystal form develops as identical 
building blocks are added continuously. Figure 1 shows an idealized picture of 
the growth process, as imagined two centuries ago. The building blocks here 
are atoms or groups of atoms. The crystal thus formed is a three-dimensional 
periodic array of identical building blocks, apart from any imperfections and 
impurities that may accidentally be included or built into the structure. 

The original experimental evidence for the periodicity of the structure 
rests on the discovery by mineralogists that the index numbers that define the 
orientations of the faces of a crystal are exact integers. This evidence was sup- 
ported by the discovery in 1912 of x-ray diffraction by crystals, when Laue de- 
veloped the theory of x-ray diffraction by a periodic array, and his coworkers 
reported the first experimental observation of x-ray diffraction by crystals. 
The importance of x-rays for this task is that they are waves and have a wave- 
length comparable with the length of a building block of the structure. Such 
analysis can also be done with neutron diffraction and with electron diffraction, 
but x-rays are usually the tool of choice. 

The diffraction work proved decisively that crystals are built of a periodic 
array of atoms or groups of atoms. With an established atomic model of a crys- 
tal, physicists could think much further, and the development of quantum the- 
ory was of great importance to the birth of solid state physics. Related studies 
have been extended to noncrystalline solids and to quantum fluids. The wider 
field is known as condensed matter physics and is one of the largest and most 
vigorous areas of physics. 


4 


Lattice Translation Vectors 

An ideal crystal is constructed by the infinite repetition of identical groups 
of atoms (Fig. 2). A group is called the basis. The set of mathematical points to 
which the basis is attached is called the lattice. The lattice in three dimensions 
may be defined by three translation vectors a t , a 2 , a 3 , such that the arrange- 
ment of atoms in the crystal looks the same when viewed from the point r as 
when viewed from every point r' translated by an integral multiple of the as: 

r' = r + u x a, + a 2 a 2 + w 3 a 3 . (1) 

Here u t , u 2 , u 3 are arbitrary integers. The set of points r' defined by (1) for all 
u u u 2 , u 3 defines the lattice. 

The lattice is said to be primitive if any two points from which the atomic 
arrangement looks the same always satisfy (1) with a suitable choice of the in- 
tegers Uj. This statement defines the primitive translation vectors a,. There 
is no cell of smaller volume than aj • a 2 X a 3 that can serve as a building block 
for the crystal structure. We often use the primitive translation vectors to de- 
fine the crystal axes, which form three adjacent edges of the primitive paral- 
lelepiped. Nonprimitive axes are often used as crystal axes when they have a 
simple relation to the symmetry of the structure. 


Figure 2 The crystal structure is formed by 
the addition of the basis (b) to every lattice 
point of the space lattice (a). By looking at 
(c), one can recognize the basis and then one 
can abstract the space lattice. It does not 
matter where the basis is put in relation to a 
lattice point. 
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Basis and the Crystal Structure 

The basis of the crystal structure can be identified once the crystal axes 
have been chosen. Figure 2 shows how a crystal is made by adding a basis to 
every lattice point — of course the lattice points are just mathematical con- 
structions. Every basis in a given crystal is identical to every other in composi- 
tion, arrangement, and orientation. 

The number of atoms in the basis may be one, or it may be more than one. 
The position of the center of an atom j of the basis relative to the associated 
lattice point is 

r i3 x A + ^fa + ya 3 . (2) 

We may arrange the origin, which we have called the associated lattice point, 
so that 0 £ Xj, ijj, Zj < 1. 










Figure 3a Lattice points of a space lattice in two dimensions. All pairs of vectors a,, a 2 arc trans- 
lation vectors of the lattice. But a/", a 2 '" are not primitive translation vectors because we cannot 
form the lattice translation T from integral combinations of a/" and a 2 "'. The other pairs shown 
of a 2 and a, may be taken as the primitive translation vectors of the lattice. The parallelograms 1, 
2, 3 are equal in area and any of them could be taken as the primitive cell. The parallelogram 4 has 
twice the area of a primitive cell. 


Figure 3b Primitive cell of a space lattice in three dimensions. 

Figure 3c Suppose these points are identical atoms: Sketch in on the figure a set of lattice points, 
a choice of primitive axes, a primitive cell, and the basis of atoms associated with a lattice point. 
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Figure 4 A primitive cell may also be chosen fol- 
lowing this procedure: (1) draw lines to connect a 
given lattice point to all nearby lattice points; (2) at 
the midpoint and normal to these lines, draw new 
lines or planes. The smallest volume enclosed in this 
way is the Wigner-Seitz primitive cell. All space may 
be fdled by these cells, just as by the cells of Fig. 3. 



Primitive Lattice Cell 

The parallelepiped defined by primitive axes a l5 a 2 , a 3 is called a primitive 
cell (Fig. 3b). A primitive cell is a type of cell or unit cell. (The adjective unit is 
superfluous and not needed.) A cell will fill all space by the repetition of suit- 
able crystal translation operations. A primitive cell is a minimum-volume cell. 
There are many ways of choosing the primitive axes and primitive cell for a 
given lattice. The number of atoms in a primitive cell or primitive basis is 
always the same for a given crystal structure. 

There is always one lattice point per primitive cell. If the primitive cell is a 
parallelepiped with lattice points at each of the eight corners, each lattice 
point is shared among eight cells, so that the total number of lattice points in 
the cell is one: 8 X f = 1. The volume of a parallelepiped with axes a b a 2 , a 3 is 

V c = (aj'ajXaj) , (3) 

by elementary vector analysis. The basis associated with a primitive cell is called 
a primitive basis. No basis contains fewer atoms than a primitive basis contains. 
Another way of choosing a primitive cell is shown in Fig. 4. This is known to 
physicists as a Wigner-Seitz cell. 


FUNDAMENTAL TYPES OF LATTICES 

Crystal lattices can be carried or mapped into themselves by the lattice 
translations T and by various other symmetry operations. A typical symmetry 
operation is that of rotation about an axis that passes through a lattice point. 
Lattices can be found such that one-, two-, three-, four-, and sixfold rotation 
axes carry the lattice into itself, corresponding to rotations by 2 tt, 2n/2, 2W3, 
27t/ 4, and 27r/6 radians and by integral multiples of these rotations. The rota- 
tion axes are denoted by the symbols 1, 2, 3, 4, and 6. 

We cannot find a lattice that goes into itself under other rotations, such as 
by 2-7T/7 radians or 27 t/ 5 radians. A single molecule properly designed can have 
any degree of rotational symmetry, but an infinite periodic lattice cannot. We 
can make a crystal from molecules that individually have a fivefold rotation axis, 
but we should not expect the lattice to have a fivefold rotation axis. In Fig. 5 we 
show what happens if we try to construct a periodic lattice having fivefold 
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Figure 5 A fivefold axis of symmetry can- 
not exist in a periodic lattice because it is 
not possible to fill the area of a plane with 
a connected array of pentagons. We can, 
however, fill all the area of a plane with just 
two distinct designs of "tiles" or elementary 
polygons. 







Figure 6 (a) A plane of symmetry parallel to the faces of a cube, (b) A diagonal plane of symmetry 
in a cube, (c) The three tetrad axes of a cube, (d) The four triad axes of a cube, (e) The six diad axes 
of a cube. 


symmetry: the pentagons do not fit together to fill all space, showing that we can- 
not combine fivefold point symmetry with the required translational periodicity. 

By lattice point group we mean the collection of symmetry operations 
which, applied about a lattice point, carry the lattice into itself. The possible ro- 
tations have been listed. We can have mirror reflections m about a plane through 
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a lattice point. The inversion operation is composed of a rotation of ir followed 
by reflection in a plane normal to the rotation axis; the total effect is to replace r 
by — r. The symmetry axes and symmetry planes of a cube are shown in Fig. 6. 

Two-Dimensional Lattice Types 

The lattice in Fig. 3a was drawn for arbitrary a 1 and a 2 . A general lattice 
such as this is known as an oblique lattice and is invariant only under rotation 
of 7 t and 27T about any lattice point. But special lattices of the oblique type can 
be invariant under rotation of 277/3, 27t/4, or 27r/6, or under mirror reflection. 
We must impose restrictive conditions on ai and a 2 if we want to construct a lat- 
tice that will be invariant under one or more of these new operations. There are 
four distinct types of restriction, and each leads to what we may call a special 
lattice type. Thus there are live distinct lattice types in two dimensions, the 
oblique lattice and the four special lattices shown in Fig. 7. Bravais lattice is 
the common phrase for a distinct lattice type; we say that there are five Bravais 
lattices in two dimensions. 



• • • 


• • • 


(a) Square lattice (b) Hexagonal lattice 

la, 1 = lun|; <p = 90° 'a, 1 ia 2 |:tf 120° 



< (c) Rectangular lattice (d) Centered rectangular lattice: 

!*,•] rf | aj . ,f = 90“ axes are shown for both the 

primitive cell and for the 
rectangular unit cell, lor 
which U,| ■*- 1 a 2 1 r <P - 90°- 

Figure 7 Four special lattices in two dimensions. 
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Three-Dimensional Lattice Types 

The point symmetry groups in three dimensions require the 14 different 
lattice types listed in Table 1. The general lattice is triclinic, and there are 
13 special lattices. These are grouped for convenience into systems classified 
according to seven types of cells, which are triclinic, monoclinic, orthorhom- 
bic, tetragonal, cubic, trigonal, and hexagonal. The division into systems is 
expressed in the table in terms of the axial relations that describe the cells. 
The cells in Fig. 8 are conventional cells: of these only the sc is a primitive cell. 
Often a nonprimitive cell has a more obvious relation with the point symmetry 
operations than has a primitive cell. 

There are three lattices in the cubic system: the simple cubic (sc) lattice, 
the body-centered cubic (bcc) lattice, and the face-centered cubic (fee) lattice. 


Table 1 The 14 lattice types in three dimensions 


System 

Number of 
lattices 

Restrictions on conventional 
cell axes and angles 

Triclinic 

1 

a l ^ ^ ^3 

a p y 

Monoclinic 

2 

d\ (I 2 #3 

a = y = 90° i= ft 

Orthorhombic 

4 

d\ ^ do Clo 

a = P = y = 90° 

Tetragonal 

2 

d\ ~~ d-2 ^ #3 

a = p = y = 90° 

Cubic 

3 

(l 1 = do ~ do 

a = /3 = y = 90° 

Trigonal 

1 

d\ — do = <23 

a = p = y < 120°, + 90° 

Hexagonal 

1 

a= 1 3 = 90° 
y = 120° 



Figure 8 The cubic space lattices. The cells shown are the conventional cells. 
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Table 2 Characteristics of cubic lattices" 



Simple 

Body-centered 

Face-centered 

Volume, conventional cell 

a 3 

a 3 

a 3 

Lattice points per cell 

1 

2 

4 

Volume, primitive cell 

a 3 

V 

■2 a 

y 

Lattice points per unit volume 

1/a 3 

2/a 3 

4 /a 3 

Number of nearest neighbors 

6 

8 

12 

Nearest-neighbor distance 

a 

3 1/2 a/2 = 0.866a 

a/2 m = 0.707a 

Number of second neighbors 

12 

6 

6 

Second neighbor distance 

2 1/2 a 

a 

a 

Packing fraction" 

i 

e' IT 

=0.524 

§17 V3 
=0.680 

swV 2 
=0.740 


"The packing fraction is the maximum proportion of the available volume that can be filled 
with hard spheres. 



Figure 9 Body-centered cubic lattice, showing a 
primitive cell. The primitive cell shown is a rhombo- 
hedron of edge jv3 a, and the angle between adja- 
cent edges is 109°28'. 



Figure 10 Primitive translation vectors of the body- 
centered cubic lattice; these vectors connect the lattice 
point at the origin to lattice points at the body centers. 
The primitive cell is obtained on completing the rhom- 
bohedron. In terms of the cube edge a, the primitive 
translation vectors are 


a,=ja(x + y — z) ; a 2 = \a{— x + y + z) ; 

a3 = 5«(*-y + z) • 

Here x, y, z are the Cartesian unit vectors. 


The characteristics of the three cubic lattices are summarized in Table 2. A 
primitive cell of the bcc lattice is shown in Fig. 9, and the primitive translation 
vectors are shown in Fig. 10. The primitive translation vectors of the fee lattice 
are shown in Fig. 11. Primitive cells by definition contain only one lattice 
point, but the conventional bcc cell contains two lattice points, and the fee cell 
contains four lattice points. 
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Figure 11 The rhombohedral primitive cell of the face-centered 
cubic crystal. The primitive translation vectors a t , a 2 , a 3 connect 
the lattice point at the origin with lattice points at the face centers. 
As drawn, the primitive vectors are: 

a,=|a(x + y); a z = f «(y + z) ; a 3 = 5 a(z + x) . 

The angles between the axes are 60°. 



Figure 12 Relation of the primitive cell 
in the hexagonal system (heavy lines) to 
a prism of hexagonal symmetry. Here 

u i ~ fl 2 ^ ° 3 - 


The position of a point in a cell is specified by (2) in terms of the atomic 
coordinates x, y , z. Here each coordinate is a fraction of the axial length a h a 2 , 
a 3 in the direction of the coordinate axis, with the origin taken at one corner of 
the cell. Thus the coordinates of the body center of a cell are |||, and the face 
centers include §|0, 0|s; |0|. In the hexagonal system the primitive cell is a 
right prism based on a rhombus with an included angle of 120°. Figure 12 
shows the relationship of the rhombic cell to a hexagonal prism. 


INDEX SYSTEM FOR CRYSTAL PLANES 

The orientation of a crystal plane is determined by three points in the 
plane, provided they are not collinear. If each point lay on a different crystal 
axis, the plane could be specified by giving the coordinates of the points in 
terms of the lattice constants a x , a 2 , a 3 . However, it turns out to be more useful 
for structure analysis to specify the orientation of a plane by the indices deter- 
mined by the following rules (Fig. 13). 

• Find the intercepts on the axes in terms of the lattice constants a x , a 2 , a 3 . 
The axes may be those of a primitive or nonprimitive cell. 
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Figure 13 This plane intercepts 
the a,, a 2 , a 3 axes at 3a b 2a 2 , 2a 3 . 
The reciprocals of these numbers 
are 5, l, |. The smallest three inte- 
gers having the same ratio are 2, 3, 
3, and thus the indices of the plane 
are (233). 




(100) (110) 





Figure 14 Indices of important planes in a cubic crystal. The plane (200) is parallel to (100) and 
to (100). 


• Take the reciprocals of these numbers and then reduce to three integers 
having the same ratio, usually the smallest three integers. The result, en- 
closed in parentheses ( hkl ), is called the index of the plane. 

For the plane whose intercepts are 4, 1, 2, the reciprocals are 1, and the 
smallest three integers having the same ratio are (142). For an intercept at infin- 
ity, the corresponding index is zero. The indices of some important planes in a 
cubic crystal are illustrated by Fig. 14. The indices (hkl) may denote a single 
plane or a set of parallel planes. If a plane cuts an axis on the negative side of the 
origin, the corresponding index is negative, indicated by placing a minus sign 
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above the index: (hkl). The cube faces of a cubic crystal are (100), (010), (001), 
(100), (010), and (001). Planes equivalent by symmetry' may be denoted by curly 
brackets (braces) around indices; the set of cube faces is (100). When we speak 
of the (200) plane we mean a plane parallel to (100) but cutting the a, axis at 
The indices [uvw] of a direction in a crystal are the set of the smallest inte- 
gers that have the ratio of the components of a vector in the desired direction, 
referred to the axes. The aj axis is the [100] direction; the — a 2 axis is the [010] 
direction. In cubic crystals the direction [hkl] is perpendicular to a plane (hkl) 
having the same indices, but this is not generally true in other crystal systems. 


SIMPLE CRYSTAL STRUCTURES 

We discuss simple crystal structures of general interest: the sodium chlo- 
ride, cesium chloride, hexagonal close-packed, diamond, and cubic zinc sulfide 
structures. 

Sodium Chloride Structure 

The sodium chloride, NaCl, structure is shown in Figs. 15 and 16. The 
lattice is face-centered cubic; the basis consists of one Na + ion and one Cl - ion 



Figure 15 We may construct the sodium chloride 
crystal structure by arranging Na + and Cl - ions alter- 
nately at the lattice points of a simple cubic lattice. In 
the crystal each ion is surrounded by six nearest neigh- 
bors of the opposite charge. The space lattice is fee, 
and the basis has one Cl“ ion at 000 and one Na + ion at 
III. The figure shows one conventional cubic cell. 
The ionic diameters here are reduced in relation to the 
cell in order to clarify the spatial arrangement. 



Figure 16 Model of sodium chloride. The sodium ions are 
smaller than the chlorine ions. (Courtesy of A. N. Holden and 
P. Singer.) 
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Figure 17 Natural crystals of lead sulfide, PbS, which has the 
NaCl crystal structure. (Photograph by B. Burleson.) 



structure. The space lattice is simple 
cubic, and the basis has one Cs + ion at 
000 and one Cl ’ ion at j 


separated by one-half the body diagonal of a unit cube. There are four units of 
NaCl in each unit cube, with atoms in the positions 

Cl: 000 ; i'O ; ±0| ; 0|| . 

Na: ill'; 6o§ ; 6§0 : §)0 . 

Each atom has as nearest neighbors six atoms of the opposite kind. Represen- 
tative crystals having the NaCl arrangement include those in the following 
table. The cube edge a is given in angstroms; 1 A = 10 -s cm = 10 -10 m = 0.1 
nm. Figure 17 is a photograph of crystals of lead sulfide (PbS) from Joplin, 
Missouri. The Joplin specimens form in beautiful cubes. 


Crystal 

a 

Crystal 

a 

L i Id 

4.08 A 

AgBr 

5.77 A 

MgO 

4.20 

PbS 

5.92 

MnO 

4.43 

KC1 

6.29 

NaCl: 

5.63 

KBr 

6.59 


Cesium Chloride Structure 

The cesium chloride structure is shown in Fig. 18. There is one molecule 
per primitive cell, with atoms at the corners 000 and body-centered positions 
1 1 1 of the simple cubic space lattice. Each atom may be viewed as at the; center 
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Figure 19 A close-packed layer of spheres is shown, with centers at points marked A. A second 
and identical layer of spheres can be placed on top of this, above and parallel to the plane of the 
drawing, with centers over the points marked B. There are two choices for a third layer. It can go 
in over A or over C. If it goes in oveT A, the sequence is ABABAB . . . and the structure is hexagonal 
close-packed. If the third layer goes in over C, the sequence is ABCABCABC . . . and the structure 
is face-centered cubic. 



Figure 20 The hexagonal close-packed structure. 
The atom positions in this structure do not constitute 
a space lattice. The space lattice is simple hexagonal 
with a basis of two identical atoms associated with 
each lattice point. The lattice parameters a and c are 
indicated, where a is in the basal plane and c is the 
magnitude of the axis a 3 of Fig. 12. 


of a cube of atoms of the opposite kind, so that the number of nearest neigh- 
bors or coordination number is eight. 


Crystal 

a 

Crvstal 

a 

BeCu 

2.70 A 

LiHg 

3.29 A 

AlNi 

2.88 

NH.CI 

3.87 

CuZn (/3-brass) 

2.94 

TlBr 

3.97 

CuPd 

2.99 

CsCl 

4.11 

AgMg 

3.28 

Til 

4.20 


Hexagonal Close-Packed Structure (hep) 

There are an infinite number of ways of arranging identical spheres in a 
regular array that maximizes the packing fraction (Fig. 19). One is the face- 
centered cubic structure; another is the hexagonal close-packed structure 
(Fig. 20). The fraction of the total volume occupied by the spheres is 0.74 for 
both structures. No structure, regular or not, has denser packing. 


Figure 21 The primitive cell has a , = a 2 , with an 
included angle of 120°. The c axis (or a 3 ) is normal 
to the plane of a, and a 2 . The ideal hep structure has 
c = 1.633 a. The two atoms of one basis are shown 
as solid circles. One atom of the basis is at the ori- 
gin; the other atom is at § 55 , which means at the 
position r = |aj + |a 2 + |a 3 . 



Spheres are arranged in a single closest-packed layer A by placing each 
sphere in contact with six others in a plane. This layer may serve as either the 
basal plane of an hep structure or the (111) plane of the fee structure. A sec- 
ond similar layer B may be added by placing each sphere of B in contact with 
three spheres of the bottom layer, as in Figs. 19-21. A third layer C may be 
added in two ways. We obtain the fee structure if the spheres of the third layer 
are added over the holes in the first layer that are not occupied by B. We 
obtain the hep structure when the spheres in the third layer are placed directly 
over the centers of the spheres in the first layer. 

The number of nearest-neighbor atoms is 12 for both hep and fee struc- 
tures. If the binding energy (or free energy) depended only on the number of 
nearest-neighbor bonds per atom, there would be no difference in energy 
between the fee and hep structures. 



’ c/a Crystal 
1.633 Zn 

1.581 Cd 

1.623 Co 

1.586 Y 



c/a 

1.861 

1.886 

622 

1.570 


Crystal c/a 

Zr 1.594 

Gd 1.592 

Lu 1,586 


Diamond Structure 

The diamond structure is the structure of the semiconductors silicon and 
germanium and is related to the structure of several important semiconductor 
binary compounds. The space lattice of diamond is face-centered cubic. The 
primitive basis of the diamond structure has two identical atoms at coordinates 
000 and associated with each point of the fee lattice, as shown in Fig. 22. 
Because the conventional unit cube of the fee lattice contains 4 lattice points, 
it follows that the conventional unit cube of the diamond structure contains 
2X4 = 8 atoms. There is no way to choose a primitive cell such that the basis 
of diamond contains only one atom. 
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Figure 22 Atomic positions in the cubic cell of the diamond Figure 23 Crystal structure of diamond, 

structure projected on a cube face; fractions denote height showing the tetrahedral bond arrangement, 

above the base in units of a cube edge. The points at 0 and | 
are on the fee lattice; those at j and •} are on a similar lattice 
displaced along the body diagonal by one-fourth of its length. 

With a fee space lattice, the basis consists of two identical 
atoms at 000 and j 7 f. 


The tetrahedral bonding characteristic of the diamond structure is shown 
in Fig. 23. Each atom has 4 nearest neighbors and 12 next nearest neighbors. 
The diamond structure is relatively empty: the maximum proportion of the 
available volume which may be filled by hard spheres is only 0.34, which is 46 
percent of the filling factor for a closest-packed structure such as fee or hep. 
The diamond structure is an example of the directional covalent bonding 
found in column IV of the periodic table of elements. Carbon, silicon, germa- 
nium, and tin can crystallize in the diamond structure, with lattice constants 
a = 3.567, 5.430, 5.658, and 6.49 A, respectively. Here a is the edge of the 
conventional cubic cell. 

Cubic Zinc Sulfide Structure 

The diamond structure may be viewed as two fee structures displaced 
from each other by one-quarter of a body diagonal. The cubic zinc sulfide 
(zinc blende) structure results when Zn atoms are placed on one fee lattice and 
S atoms on the other fee lattice, as in Fig. 24. The conventional cell is a cube. 
The coordinates of the Zn atoms are 000: 0||; |0|; |j0; the coordinates of the 
S atoms are H ;; 3 3 1; 3 3 j; 3 3 3 . The lattice is fee. There are four molecules 
ZnS per conventional cell. About each atom there are four equally distant 
atoms of the opposite kind arranged at the comers of a regular tetrahedron... 
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The diamond structure allows a center-of-inversion symmetry operation 
at the midpoint of every line between nearest-neighbor atoms. The inversion 
operation carries an atom at r into an atom at — r. The cubic ZnS struc- 
ture does not have inversion symmetry. Examples of the cubic zinc sulfide 
structure are 


SiC 

ZnS 

A1P 

GaP 


4.35 A 
5.41 
5.45 
5.45 


Crystal 
ZnSe 5.65 A 

GaAs 5.65 

AlAs 
InSb 


5.66 
6 46 


The close equality of the lattice constants of several pairs, notably (Al, Ga)P 
and (Al, Ga)As, makes possible the construction of semiconductor heterojunc- 
tions (Chapter 19). 


DIRECT IMAGING OF ATOMIC STRUCTURE 

Direct images of crystal structure have been produced by transmission 
electron microscopy. Perhaps the most beautiful images are produced by scan- 
ning tunneling microscopy; in STM (Chapter 19) one exploits the large varia- 
tions in quantum tunneling as a function of the height of a fine metal tip above 
the surface of a crystal. The image of Fig. 25 was produced in this way. An 
STM method has been developed that will assemble single atoms into an orga- 
nized layer nanometer structure on a crystal substrate. 

NONIDEAL CRYSTAL STRUCTURES 

The ideal crystal of classical crystallographers is formed by the periodic 
repetition of identical units in space. But no general proof has been given that 
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Figure 25 A scanning tunneling microscope 
image of atoms on a (111) surface of fee plat- 
inum at 4 K. The nearest-neighbor spacing is 
2.78 A. (Photo courtesy of D. M. Eigler, IBM 
Research Division.) 


the ideal crystal is the state of minimum energy of identical atoms at the tem- 
perature of absolute zero. At finite temperatures this is likely not to be true. We 
give a further example here. 

Random Slacking and Polytypism 

The fee and hep structures are made up of close-packed planes of atoms. 
The structures differ in the stacking sequence of the planes, fee having the se- 
quence ABC ABC . . . and hep having the sequence ABABAB .... Structures 
are known in which the stacking sequence of close-packed planes is random. 
This is known as random stacking and may be thought of as crystalline in two 
dimensions and noncrystalline or glasslike in the third. 

Polytypism is characterized by a stacking sequence with a long repeat 
unit along the stacking axis. The best known example is zinc sulfide, ZnS, in 
which more than 150 polytypes have been identified, with the longest period- 
icity being 360 layers. Another example is silicon carbide, SiC, which occurs 
with more than 45 stacking sequences of the close-packed layers. The polytype 
of SiC known as 393R has a primitive cell with a = 3.079 A and c = 989.6 A. 
The longest primitive cell observed for SiC has a repeat distance of 594 layers. 
A given sequence is repeated many times within a single crystal. The mecha- 
nism that induces such long-range crystallographic order is not a long-range 
force, but arises from spiral steps due to dislocations in the growth nucleus 
(Chapter 20). 

CRYSTAL STRUCTURE DATA 

In Table 3 we list the more common crystal structures and lattice structures 
of the elements. Values of the atomic concentration and the density are given in 
Table 4. Many elements occur in several crystal structures and transform from 





Table 3 Crystal structures of the elements 

The data given are at room temperature for the most common form, or at 
the stated temperature in deg K. (Inorganic Crystal Structure Database 
(ICSD) online. 
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H 4K 
0.088 


Li 78K 
0.542 
4.700 
3.023 


Na 5K 

1.013 

2.652 

3.659 


K 5K 


Ca 

1.53 

2.30 

3.95 


Zn 

7.13 

6.55 

2.66 


Ga 

5.91 

5.10 

2.44 


Ge 

5.32 

4.42 

2.45 


Cr 

7.19 

8.33 

2.50 


Se 

4.81 

3.67 

2.32 


Co 

8.9 

8.97 

2.50 


2.49 


Cu 

8.93 

8.45 

2.56 


Sc 

2.99 
4.27 
| 3.25 


4.00 


Pd 

12.00 

6.80 

2,75 


Xe 4K 

3.78 

1.64 

4.34 


Zr 

6.51 

4.29 

3.17 


Mo 

10.22 

6.42 

2.72 


Sr 

2.58 

1.78 

4.30 


Sb 

6.69 

3.31 

2.91 


Te 

6.25 

2.94 

2.86 


4.837 


Ta 

16.66 

5.55 

2.86 


lr 

22.55 

7.06 

2.71 


Pt 

21.47 

6.62 

2.77 


Au 

19.28 

5.90 

2.88 


Ba 

3.59 

1.60 
4.35 


W 

19.25 

6.30 

2.74 


Hf 

13.20 

4.52 

3.13 


Hg 227 

14.26 

4.26 
3.01 


Os 

22.58 

7.14 

2.68 


Re 

21.03 

6.80 

2.74 


Ac 

10.07 

2.66 

3.76 


A r 4K 

1.77 

2.66 

3.76 


Ti 

4.51 

5.66 

2.89 


V 

6.09 

7.22 

2.62 


Mn 

7.47 

8.18 

2.24 


As 

5.77 

4.65 

3.16 


I 

4.95 

2.36 

3.54 


Y 

4.48 

3.02 

3.55 


La 

6.17 

2.70 

3.73 


CS 5K 
1.997 
0.905 
5.235 


Ne 4 k 
1.51 
36 
3.16 


Table 4 Density and atomic concentration 

The data are given at atmospheric pressure and room temperature, or at the 
stated temperature in deg K. (Crystal modifications as for Table 3.) 


Be 

1.82 

12.1 

2.22 

Mg 

1.74 

4.30 

3.20 


Density in g cm -3 (10 3 kg rrr 3 ) 
Concentration in 10 22 cm" 3 (10 2B rrr 3 ) 
Nearest-neighbor distance, in A (10‘ 10 m) 
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Th Pa U Np 
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one to the other as the temperature or pressure is varied. Sometimes two struc- 
tures coexist at the same temperature and pressure, although one may be slightly 
more stable. 


SUMMARY 

• A lattice is an array of points related by the lattice translation operator 
T = u t a i + u 2 a 2 + M 3 a 3 , where u 1( « 2 , u 3 are integers and a,, a 2 , a 3 are the 
crystal axes. 

• To form a crystal we attach to every lattice point an identical basis composed 
of s atoms at the positions r ; = Xj&i + t^a 2 + ZjSt 3 , with j = 1, 2, .... s. Here 
i, y, z may be selected to have values between 0 and 1. 

• The axes a b a 2 , a 3 are primitive for the minimum cell volume |ap a 2 X a 3 | 
for which the crystal can be constructed from a lattice translation operator T 
and a basis at every lattice point. 

Problems 

1. Tetrahedral angles. The angles between the tetrahedral bonds of diamond are the 
same as the angles between the body diagonals of a cube, as in Fig. 10. Use elemen- 
tary vector analysis to find the value of the angle. 

2. Indices of planes. Consider the planes with indices (100) and (001); the lattice is 
fee, and the indices refer to the conventional cubic cell. What are the indices of 
these planes when referred to the primitive axes of Fig. 11? 

3. Hep structure. Show that the c/a ratio for an ideal hexagonal close-packed struc- 
ture is {§) 1/2 = 1.633. If c/a is significantly larger than this value, the crystal structure 
may be thought of as composed of planes of closely packed atoms, the planes being 
loosely stacked. 
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Figure 1 Wavelength versus parti- 
cle energy, for photons, neutrons, 
and electrons. 



Photon energy, keV 
Neutron energy, 0.01 eV 
Electron energy, 100 eV 



Figure 2 Derivation of the Bragg equation 2 cl sin 6 = nA; here d is the spacing of parallel atomic 
planes and 27m is the difference in phase between reflections from successive planes. The 
reflecting planes have nothing to do with the surface planes bounding the particular specimen. 




CHAPTER 2: WAVE DIFFRACTION AND 
THE RECIPROCAL LATTICE 


DIFFRACTION OF WAVES BY CRYSTALS 


The Bragg law 

We study crystal structure through the diffraction of photons, neutrons, 
and electrons (Fig. 1). The diffraction depends on the crystal structure and on 
the wavelength. At optical wavelengths such as 5000 A, the superposition of 
the waves scattered elastically by the individual atoms of a crystal results in or- 
dinary optical refraction. When the wavelength of the radiation is comparable 
with or smaller than the lattice constant, we may find diffracted beams in 
directions quite different from the incident direction. 

W. L. Bragg presented a simple explanation of the diffracted beams from a 
crystal. The Bragg derivation is simple but is convincing only because it repro- 
duces the correct result. Suppose that the incident waves are reflected specu- 
larly from parallel planes of atoms in the crystal, with each plane reflecting 
only a very small fraction of the radiation, like a lightly silvered mirror. In 
specular (mirrorlike) reflection the angle of incidence is equal to the angle of 
reflection. The diffracted beams are found when the reflections from parallel 
planes of atoms interfere constructively, as in Fig. 2. We treat elastic scatter- 
ing, in which the energy of the x-ray is not changed on reflection. 

Consider parallel lattice planes spaced d apart. The radiation is incident in 
the plane of the paper. The path difference for rays reflected from adjacent 
planes is 2d sin 6, where 6 is measured from the plane. Constructive interfer- 
ence of the radiation from successive planes occurs when the path difference 
is an integral number n of wavelengths A, so that 

( 1 ) 


This is the Bragg law, which can be satisfied only for wavelength A ^ 2d. 

Although the reflection from each plane is specular, for only certain values 
of 0 will the reflections from all periodic parallel planes add up in phase to give 
a strong reflected beam. If each plane were perfectly reflecting, only the first 
plane of a parallel set would see the radiation, and any wavelength would be re- 
flected. But each plane reflects 10 -3 to 10 -5 of the incident radiation, so that 
10 3 to 10 ’ planes may contribute to the formation of the Bragg-reflected beam in 
a perfect crystal. Reflection by a single plane of atoms is treated in Chapter 17 
on surface physics. 

The Bragg law is a consequence of the periodicity of the lattice. Notice 
that the law does not refer to the composition of tire basis of atoms associated 


25 


26 



Figure 3 Sketch of a monochromator which by Bragg reflection selects a narrow spectrum of 
x-ray or neutron wavelengths from a broad spectrum incident beam. The upper part of the figure 
shows the analysis (obtained by reflection from a second crystal) of the purity of a 1.16 A beam of 
neutrons from a calcium fluoride crystal monochromator. (After G. Bacon.) 



Figure 4 X-ray diff ractometer recording of powdered silicon, showing a counter recording of the 
diffracted beams. (Courtesy of W. Parrish.) 


with every lattice point. We shall see, however, that the composition of the 
basis determines the relative intensity of the various orders of diffraction 
(denoted by n above) from a given set of parallel planes. Bragg reflection from 
a single crystal is shown in Fig. 3 and from a powder in Fig. 4. 


SCATTERED WAVE AMPLITUDE 

The Bragg derivation of the diffraction condition (1) gives a neat state- 
ment of the condition for the constructive interference of waves scattered 
from the lattice points. We need a deeper analysis to determine the scattering 
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intensity from the basis of atoms, which means from the spatial distribution of 
electrons within each cell. 

Fourier Analysis 

We have seen that a crystal is invariant under any translation of the form 
T = Uia t + n 2 a 2 + u 3 a 3 , where u h u 2 , « 3 are integers and a,, a 2 , a 3 are the crystal 
axes. Any local physical property of the crystal, such as the charge concentra- 
tion, electron number density, or magnetic moment density is invariant under T. 
What is most important to us here is that the electron number density n( r) is a 
periodic function of r, with periods a 3 , a 2 , a 3 in the directions of the three crys- 
tal axes, respectively. Thus 

n(r + T) = n(r) . (2) 

Such periodicity creates an ideal situation for Fourier analysis. The most inter- 
esting properties of crystals are directly related to the Fourier components of 
the electron density. 

We consider first a function n(x ) in one dimension with period a in the 
direction x. We expand n(x) in a Fourier series of sines and cosines: 

n(x) = n u + 2 [C p cos(277 px/a) + S„ sin(2irpx/a)] , (3) 

;j>0 

where the p are positive integers and C p , S p are real constants, called the 
Fourier coefficients of the expansion. The factor 2rr/a in the arguments en- 
sures that n(x) has the period a-. 

n(x + a) = n 0 + ^[C p cos(27 rpx/a + 2n rp) + S p sm(2mjx/a + 2 tw)] 

(4) 

= n 0 + ^[C p cos(27rpx/fl) + S p sin(277px/a)] = n(x) . 

We say that 277 p/a is a point in the reciprocal lattice or Fourier space of the 
crystal. In one dimension these points lie on a line. The reciprocal lattice 
points tell us the allowed terms in the Fourier series (4) or (5). A term is al- 
lowed if it is consistent with the periodicity of the crystal, as in Fig. 5; other 



— — • — • — G 

4ir_27r -2-n 4v 

a a. a a 


Figure 5 

period a 
may app< 
n(x) = Yi 


points in the reciprocal space are not allowed in the Fourier expansion of a pe- 
riodic function. 

It is convenient to write the series (4) in the compact form 

n(x) =2) n P exp(i2irpx/a) , {5} 

p 

where the sum is over all integers p: positive, negative, and zero. The coeffi- 
cients n p now are complex numbers. To ensure that n(x) is a real function, we 
require 

n- p = n p , (6) 

for then the sum of the terms in p and — p is real. The asterisk on nl p denotes 
the complex conjugate of n_ p . 

With cp = 2irpx/a, the sum of the terms in p and — p in (5) is real if (6) is 
satisfied. The sum is 

n„(cos cp + i sin ip) + n p (cos <p — i sin cp) 

y y (7) 

= (n p + n_ p ) cos cp + i(n p - n p )sin cp , 

which in turn is equal to the real function 

2Re(n p } cos cp - 21m{n p } sin cp (8) 

if (6) is satisfied. Here Re{n p } and Im{n p } are real and denote the real 
and imaginary parts of n p . Thus the number density n(x) is a real function, as 
desired. 

The extension of the Fourier analysis to periodic functions n(r) in three 
dimensions is straightforward. We must find a set of vectors G such that 

n(r)=2 n c exp(iG • r) (9) 

G 

is invariant under all crystal translations T that leave the crystal invariant. It 
will be shown below that the set of Fourier coefficients n G determines the 
x-ray scattering amplitude. 

Inversion of Fourier Series. We now show that the Fourier coefficient n p 
in the series (5) is given by 

f a 

n p =a _1 I dx n(x) exp(— i2upx/a) . (10) 

•'0 


Substitute (5) in (10) to obtain 


n p = a 1 2 *V f dx cxp[t27r(p' - p)x/a] . 


(ID 
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If p' + p the value of the integral is 


i2ir(p' - p) 


( e '2*p'-p>-l) = 0 


because p' — p is an integer and exp[i27r(integer)] = 1. For the term p' = p the 
integrand is exp(iO) = 1, and the value of the integral is a, so that n ; , = a~ L n p a = 
n p , which is an identity, so that (10) is an identity. 

As in (10), the inversion of (9) gives 

n G = V 7 1 f dV n(r) exp(-iG • r) . (12) 

-'cell 

Here V c is the volume of a cell of the crystal. 


Reciprocal Lattice Vectors 

To proceed further with the Fourier analysis of the electron concentration we 
must find the vectors G of the Fourier sum Xn G exp(iG • r) as in (9). There is a 
powerful, somewhat abstract procedure for doing this. The procedure forms the 
theoretical basis for much of solid state physics, where Fourier analysis is the 
order of the day. 


We construct the axis vectors b,, b 2 , b 3 of the reciprocal lattice: 


br^2#^%- ; 

£ 1 ] * 


b, H 2v 


a, X a ! 

aj * a 2 X a 3 : 


S 3 “ 27T 


a 3 X a 2 




a 3 * a 2 X a 3 


(13) 


The factors 27r are not used by crystallographers but are convenient in solid state 
physics. 

If a!, a 2 , a 3 are primitive vectors of the crystal lattice, then bj, b 2 , b 3 are 
primitive vectors of the reciprocal lattice. Each vector defined by (13) is 
orthogonal to two axis vectors of the crystal lattice. Thus b 1; b 2 , b 3 have the 
property 

b, • a ; = 2'ir8 ij , (14) 

where S tJ = 1 if i = j and By = 0 if i + j. 

Points in tlie reciprocal lattice are mapped by the set of vectors 

G — - Ujb 3 T ndbg T u 3 b 3 , (15) 

where Oj, u 2 , n 3 are integers. A vector G of this form is a reciprocal lattice vector. 

The vectors G in the Fourier series (9) are just the reciprocal lattice vectors (15), 
for then the Fourier series representation of the electron density has the desired in- 
variance under any crystal translation T = u |a, + u 2 a 2 + « 3 a 3 . From (9), 


n(r + T) = 2 n c, exp(iG • r) exp(iG • T) . 

G 


( 16 ) 
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But exp(iG • T) = 1, because 

exp(iG • T) = expWujbj + u 2 b 2 + u 3 b 3 ) • (u^ + u 2 a 2 + u 3 a 3 )] ^ ^ 

= exp[i2'n - (t; 1 u 1 + v 2 u 2 + u 3 ti 3 )] . 

The argument of the exponential has the form 2 m times an integer, because 
+ v 2 u 2 + v 3 u 3 is an integer, being the sum of products of integers. Thus by 
(9) we have the desired invariance, n(r + T) = n( r) = ^ n c cxp(t'G • r). 

Every ciystal structure has two lattices associated with it, the crystal lattice 
and the reciprocal lattice. A diffraction pattern of a crystal is, as we shall show, 
a map of the reciprocal lattice of the crystal. A microscope image, if it could be 
resolved on a fine enough scale, is a map of the crystal structure in real space. 
The two lattices are related by the definitions (13). Thus when we rotate a crys- 
tal in a holder, we rotate both the direct lattice and the reciprocal lattice. 

Vectors in the direct lattice have the dimensions of [length]; vectors in the 
reciprocal lattice have the dimensions of [1/length]. The reciprocal lattice is a 
lattice in the Fourier space associated with the crystal. The term is motivated 
below. Wavevectors are always drawn in Fourier space, so that eveiy position 
in Fourier space may have a meaning as a description of a wave, but there is a 
special significance to the points defined by the set of Gs associated with a 
crystal structure. 

Diffraction Conditions 

Theorem. The set of reciprocal lattice vectors G determines the possible 
x-ray reflections. 

We see in Fig. 6 that the difference in phase factors is exp[i(k — k') • r] 
between beams scattered from volume elements r apart. The wavevectors of 
the incoming and outgoing beams are k and k'. We suppose that the amplitude 



Figure 6 The difference in path length of the incident wave k at the points O, r is r sin cp, and the 
difference in phase angle is (2 •nr sin <p)/A, which is equal to k • r. For the diffracted wave the dif- 
ference in phase angle is — k' ■ r. The total difference in phase angle is (k — k' ) ■ r, and the wave 
scattered from (IV at r has the phase factor exp[i(k — k') - r] relative to the wave scattered from a 
volume element at the origin O. 
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Figure 7 Definition of the scattering vector Ak such that 
k + Ak = k'. In elastic scattering the magnitudes satisfy 
h' = k. Further, in Bragg scattering from a periodic lattice, 
any allowed Ak must equal some reciprocal lattice vector G. 


of the wave scattered from a volume element is proportional to the local elec- 
tron concentration n(r). The total amplitude of the scattered wave in the di- 
rection of k' is proportional to the integral over the crystal of n(r) c/V times the 
phase factor exp[i(k — k') • r]. 

In other words, the amplitude of the electric or magnetic field vectors in 
the scattered electromagnetic wave is proportional to the following integral 
which defines the quantity F that we call the scattering amplitude: 

F = / dV n( r) exp[i(k — k') • r]=/ dV n(r) exp(— iAk • r) , (18) 

where k — k' = — Ak, or 

k + Ak = k' . (19) 

Here Ak measures the change in wavevector and is called the scattering 
vector (Fig. 7). We add Ak to k to obtain k', the wavevector of the scat- 
tered beam. 

We introduce into (18) the Fourier components (9) of n(r) to obtain for 
the scattering amplitude 

F = ^ f dV n G exp[i(G — Ak) • r] . (20) 

G 

When the scattering vector Ak is equal to a particular reciprocal lattice vector, 

m-m (2i) 

the argument of the exponential vanishes and F = Vn G . It is a simple exercise 
(Problem 4) to show that F is negligibly small when Ak differs significantly 
from any reciprocal lattice vector. 

In elastic scattering of a photon its energy hoi is conserved, so that the 
frequency u>' = ck' of the emergent beam is equal to the frequency of the inci- 
dent beam. Thus the magnitudes k and k' are equal, and k 2 = k' 2 , a result that 
holds also for elastic scattering of electron and neutron beams. From (21) we 
found Ak = G or k + G = k', so that the diffraction condition is written as 
(k + G) 2 = k 2 , or 



( 22 ) 



This is the central result of the theory of elastic scattering of waves in a 
periodic lattice. If G is a reciprocal lattice vector, so is — G, and with this sub- 
stitution we can write (22) as 

(23) 

This particular expression is often used as the condition for diffraction. 

Equation (23) is another statement of the Bragg condition (1). The result 
of Problem 1 is that the spacing d(hkl) between parallel lattice planes that are 
normal to the direction G = hb 1 + kb. 2 + (b 3 is d(hkl) = 27T/IGI. Thus the 
result 2k • G = G 2 may be written as 

2(27r/A) sin d = 2ir/d{hkl) , 

or 2 d(hkl) sin 6 = A. Here 0 is the angle between the incident beam and the 
crystal plane. 

The integers hkl that define G are not necessarily identical with the in- 
dices of an actual crystal plane, because the hkl may contain a common factor 
n, whereas in the definition of the indices in Chapter 1 the common factor has 
been eliminated. We thus obtain the Bragg result: 

2d sin 0 = nA , (24) 

where d is the spacing between adjacent parallel planes with indices h/n, 
k/n, l/n. 

Laue Equations 

The original result (21) of diffraction theory, namely that Ak = G, may be 
expressed in another way to give what are called the Laue equations. These 
are valuable because of their geometrical representation. Take the scalar prod- 
uct of both Ak and G successively with a h a 2 , a 3 . From (14) and (15) we get 

a, • Ak = 2w 1 ; a 2 • Ak = 2 irv z ; a 3 • Ak = 2 . (25) 

These equations have a simple geometrical interpretation. The first equation 
a x • Ak = 2m ] tells us that Ak lies on a certain cone about the direction of a x . 
The second equation tells us that Ak lies on a cone about a 2 as well, and the 
third equation requires that Ak lies on a cone about a 3 . Thus, at a reflection 
Ak must satisfy all three equations; it must lie at the common line of intersec- 
tion of three cones, which is a severe condition that can be satisfied only by 
systematic sweeping or searching in wavelength or crystal orientation — or by 
sheer accident. 

A beautiful construction, the Ewald construction, is exhibited in Fig. 8. 
This helps us visualize the nature of the accident that must occur in order to 
satisfy the diffraction condition in three dimensions. 
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Figure 8 The points on the right-hand side are reciprocal-lattice points of the crystal. The vector 
k is drawn in the direction of the incident x-ray beam, and the origin is chosen such that k termi- 
nates at any reciprocal lattice point. We draw a sphere of radius k = 2WA about the origin of k. 
A diffracted beam will be formed if this sphere intersects any other point in the reciprocal lattice. 
The sphere as drawn intercepts a point connected with the end of k by a reciprocal lattice vector 
G. The diffracted x-ray beam is in the direction k' = k + G. The angle 0 is the Bragg angle of 
Fig. 2. This construction is due to P. P. Ewald. 


BRILLOUIN ZONES 

Brillouin gave the statement of the diffraction condition that is most 
widely used in solid state physics, which means in the description of electron 
energy band theory and of the elementary excitations of other kinds. A 
Brillouin zone is defined as a Wigner-Seitz primitive cell in the reciprocal lat- 
tice. (The construction in the direct lattice was shown in Fig. 1.4.) The 
Brillouin zone gives a vivid geometrical interpretation of the diffraction condi- 
tion 2k • G = G 2 of Eq. (23). We divide both sides by 4 to obtain 

k • (f G) = (f G) 2 . (26) 

We now work in reciprocal space, the space of the k’s and G’s. Select a 
vector G from the origin to a reciprocal lattice point. Construct a plane normal 
to this vector G at its midpoint. This plane forms a part of a zone boundary 
(Fig. 9a). An x-ray beam in the crystal will be diffracted if its wavevector k has 
the magnitude and direction required by (26). The diffracted beam will then 
be in the direction k — G, as we see from (19) with Ak = — G. Thus the 
Brillouin construction exhibits all the wavevectors k which can be Bragg- 
reflected by the crystal. 
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Figure 9a Reciprocal lattice points near the point O at 
the origin of the reciprocal lattice. The reciprocal lattice 
vector G ( connects points OC; and G D connects OD. 
Two planes 1 and 2 are drawn which are the perpendic- 
ular bisectors of G ( and G D , respectively. Any vector 
from the origin to the plane 1, such as k, will satisfy the 
diffraction condition k, • (f G c ) = (| G c ) 2 . Any vector 
from the origin to the plane 2, such as k 2 , will satisfy the 
diffraction condition k 2 • (§ G c ) = (| G D ) 2 . 



Figure 9b Square reciprocal lattice with reciprocal 
lattice vectors shown as fine black lines. The lines 
shown in white are perpendicular bisectors of the rec- 
iprocal lattice vectors. The central square is the small- 
est volume about the origin which is bounded entirely 
by white lines. The square is the Wigner-Seitz primi- 
tive cell of the reciprocal lattice. It is called the first 
Brillouin zone. 


The set of planes that are the perpendicular bisectors of the reciprocal 
lattice vectors is of general importance in the theory of wave propagation in 
crystals: A wave whose waveveetor drawn from the origin terminates on any of 
these planes will satisfy the condition for diffraction. These planes divide the 
Fourier space of the crystal into fragments, as shown in Fig. 9b for a square 
lattice. The central square is a primitive cell of the reciprocal lattice. It is a 
Wigner-Seitz cell of the reciprocal lattice. 

The central cell in the reciprocal lattice is of special importance in the the- 
ory of solids, and we call it the first Brillouin zone. The first Brillouin zone is 
the smallest volume entirely enclosed by planes that are the perpendicular bi- 
sectors of the reciprocal lattice vectors drawn from the origin. Examples are 
shown in Figs. 10 and 11. 

Historically, Brillouin zones are not part of the language of x-ray diffrac- 
tion analysis of crystal structures, but the zones are an essential part of the 
analysis of the electronic energy-band structure of crystals. 

Reciprocal Lattice to sc Lattice 

The primitive translation vectors of a simple cubic lattice may be taken as 
the set 


a 3 = flz . 


a [ = rtX ; 


a 2 = ay ; 


(27a) 
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Figure 10 Construction of the first Brillouin 

* zone for an oblique lattice in two dimensions. We 
first draw a number of vectors from O to nearby 
points in the reciprocal lattice. Next we construct 
lines perpendicular to these vectors at their mid- 
points. The smallest enclosed area is the first Bril- 

• louin zone. 




t-i 



■ - 

' 

0 ! 

b * # k^ 

Reciprocal lattice 


Figure 1 1 Crystal and reciprocal lattices in one dimension. The basis vector in the reciprocal lat- 
tice is b, of length equal to 2 v/a. The shortest reciprocal lattice vectors from the origin are b and 
— b. The perpendicular bisectors of these vectors form the boundaries of the first Brillouin zone. 
The boundaries are at k = ± v/a . 


Here x, y, z are orthogonal vectors of unit length. The volume of the cell is 
a t ' a 2 X a 3 = a 3 . The primitive translation vectors of the reciprocal lattice are 
found from the standard prescription (13): 

b 3 = (27 T/a)x ; b 2 = ( 2v/a)y ; b 3 = ( 2v/a)i . (27b) 

Here the reciprocal lattice is itself a simple cubic lattice, now of lattice 
constant 2tr/a. 



Figure 12 Primitive basis vectors of the body-centered 


Figure 13 First Brillouin zone of the body- 
centered cubic lattice. The figure is a regular 


cubic lattice. 


rhombic dodecahedron. 


The boundaries of the first Brillouin zones are the planes normal to the six 
reciprocal lattice vectors ±b 3 , ±b 2 , ±b 3 at their midpoints: 

±|b, = ±(ir/fl)x ; ±1 b 2 = ±(n/a)y ; ±|b 3 = ±(ir/a)i . (28) 

The six planes bound a cube of edge 2n/a and of volume (2v/a) 3 ; this cube is 
the first Brillouin zone of the sc crystal lattice. 

Reciprocal Lattice to bcc Lattice 

The primitive translation vectors of the bcc lattice (Fig. 12) are 

a r = |a( — x + y + z) ; a 2 = | a(x — y + z) ; a 3 = |a(x + y — z) , (29) 

where a is the side of the conventional cube and x, y, z are orthogonal unit 
vectors parallel to the cube edges. The volume of the primitive cell is 

V= |aj • a 2 X a 3 | =|a 3 . (30) 

The primitive translations of the reciprocal lattice are defined by (13). We 
have, using (28), 

b x = {2ir/a){y + z) ; b 2 = (2 ir/a){i + z) ; b 3 = (27r/a)(x + y) . (31) 

Note by comparison with Fig. 14 (p. 37) that these are just the primitive 
vectors of an fee lattice, so that an fee lattice is the reciprocal lattice of the bcc 
lattice. 

The general reciprocal lattice vector is, for integral e b u 2 , u 3 , 

G = Cjbi + v 2 b. 2 + u 3 b 3 = (27r/a)[(t> 2 + u 3 )x + (uj + u 3 )y + (u 3 + u 2 )z] . (32) 
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Figure 14 Primitive basis vectors of the 
face-centered cubic lattice. 


The shortest G’s are the following 12 vectors, where all choices of sign are 
independent: 

(2ir/a)(±y ± z) ; (27r/a)(±x ± z) ; (2 Tr/a)(±x±y) . (33) 

One primitive cell of the reciprocal lattice is the parallelepiped described 
by the b 2 , b 2 , b 3 defined by (31). The volume of this cell in reciprocal space 
is bj • b 2 X b 3 = 2(2ir/a)'\ The cell contains one reciprocal lattice point, 
because each of the eight corner points is shared among eight parallelepipeds. 
Each parallelepiped contains one-eighth of each of eight corner points (see 
Fig. 12). 

Another primitive cell is the central (Wigner-Seitz) cell of the reciprocal 
lattice which is the first Brillouin zone. Each such cell contains one lattice 
point at the central point of the cell. This zone (for the bcc lattice) is bounded 
by the planes normal to the 12 vectors of Eq. (33) at their midpoints. The zone 
is a regular 12-faced solid, a rhombic dodecahedron, as shown in Fig. 13. 

Reciprocal Lattice to fee Lattice 

The primitive translation vectors of the fee lattice of Fig. 14 are 

a 1 = ^fl(y+z); a 2 = ^a(x + z) ; a 3 = |a(x + y). (34) 

The volume of the primitive cell is 


V = |aj • a 2 X a 3 1 = ^ a 3 


(35) 
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Figure 15 Brillouin zones of 
the face-centered cubic lattice. 
The cells are in reciprocal space, 
and the reciprocal lattice is body 
centered. 


The primitive translation vectors of the lattice reciprocal to the fee 
lattice are 

b! = (2 v/a)(—x + y + z) ; b 2 = (2ir/a)(x — y + z) ; 
b 3 = (2ir/a)(x + y — z) . 

These are primitive translation vectors of a bcc lattice, so that the bcc lattice is 
reciprocal to the fee lattice. The volume of the primitive cell of the reciprocal 
lattice is 4(27r/«) 3 . 

The shortest G’s are the eight vectors: 

(27r/a)(±x ± y ± z) . (37) 


The boundaries of the central cell in the reciprocal lattice are determined 
for the most part by the eight planes normal to these vectors at their 
midpoints. But the corners of the octahedron thus formed are cut by the 
planes that are the perpendicular bisectors of six other reciprocal lattice 
vectors: 


(2'7r/fl)(±2x) ; (2v/a)(±2y) ; ( 2ir/a)(±2i ) . (38) 

Note that (2ir/a)(2x) is a reciprocal lattice vector because it is equal to b 2 + b 3 . 
The first Brillouin zone is the smallest bounded volume about the origin, the 
truncated octahedron shown in Fig. 15. The six planes bound a cube of edge 
47 t/« and (before truncation) of volume (47 r/o) 3 . 
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FOURIER ANALYSIS OF THE BASIS 

When the diffraction condition Ak = G of Eq. (21) is satisfied, the scatter- 
ing amplitude (18) for a crystal of N cells may be written as 

F g = N f dV n(r) exp( -iG • r) = NS G . (39) 

J cell 

The quantity S G is called the structure factor and is defined as an integral 
over a single cell, with r = 0 at one corner. 

Often it is useful to write the electron concentration n(r) as the super- 
position of electron concentration functions n ; associated with each atom j 
of the cell. If r ; is the vector to the center of atom j, then the function 
rij( r — rj) defines the contribution of that atom to the electron concentration 
at r. The total electron concentration at r due to all atoms in the single cell is 
the sum 


n(r) = 2 «/( r ~ r )) (40) 

over the s atoms of the basis. The decomposition of n(r) is not unique, for we 
cannot always say how much charge density is associated with each atom. This 
is not an important difficulty. 

The structure factor defined by (39) may now be written as integrals over 
the s atoms of a cell: 

S G = 2 J" dV Uj( r - rj) exp(— iG • r) 

„ (41) 

= 2j ex p( ~ iG ' r j) f dV rij(p) exp( -f G • p) , 

J 

where p = r — Tj. We now define the atomic foim factor as 

fj = f dV n,(p) exp( -£G • p) y (42) 


integrated over all space. If rij(p) is an atomic property,^ is an atomic property. 

We combine (41)and (42) to obtain the structure factor of the basis in 
the form 


S G = 2^exp(-iG • tj) . (43) 

j 

The usual form of this result follows on writing for atom j: 

Tj = Xj&i + tjj& 2 + ZjSl 3 , 


(44) 


( 45 ) 


as in (1.2). Then, for the reflection labelled by Oj, o 2 , o 3 , we have 

G • t } = (ojbj + o 2 b 2 + o 3 b 3 ) • {xj&y + xjj * 2 + z,a 3 ) 

= 2ir(«i Xj + v 2 ijj + v&j) , 

so that (43) becomes 

S G (ui 0 2 o 3 ) = 'Zfj exp[-i27 Tiv&j + v&j + o^)] . (46) 

The structure factor S need not be real because the scattered intensity will 
involve S*S, where S* is the complex conjugate of S so that S*S is real. 

Structure Factor of the bcc Lattice 

The bcc basis referred to the cubic cell has identical atoms at = y 1 = 
Zi = 0 and at x 2 = y 2 = z 2 = j . Thus (46) becomes 

S(vjv 2 v 3 ) =/{l + exp[-i7r(o! + v 2 + o 3 )]} , (47) 

where / is the form factor of an atom. The value of S is zero whenever 
the exponential has the value —1, which is whenever the argument 
is — t7r X (odd integer). Thus we have 

S = 0 when v l + v 2 + v 3 — odd integer ; 

S = 2/ when + v 2 + o 3 = even integer . 

Metallic sodium has a bcc structure. The diffraction pattern does not con- 
tain lines such as (100), (300), (111), or (221), but lines such as (200), (110), and 
(222) will be present; here the indices (o^fs) are referred to a cubic cell. What 
is the physical interpretation of the result that the (100) reflection vanishes? 
The (100) reflection normally occurs when reflections from the planes that 
bound the cubic cell differ in phase by 2 tt. In the bcc lattice there is an inter- 
vening plane (Fig. 16) of atoms, labeled the second plane in the figure, which is 
equal in scattering power to the other planes. Situated midway between them, 
it gives a reflection retarded in phase by 7r with respect to the first plane, 
thereby canceling the contribution from that plane. The cancellation of the 
(100) reflection occurs in the bcc lattice because the planes are identical in 
composition. A similar cancellation can easily be found in the hep structure. 

Structure Factor of the fee Lattice 

The basis of the fee structure referred to the cubic cell has identical atoms 
at 000; 0s|; §0|; ||0. Thus (46) becomes 

5 ( 0 x 0205 ) = f[ 1 + exp[— iv(v 2 + o 3 )] + exp[— i7r(o! + o 3 )] 

+ expf— Moj +o 2 )]} . 


( 48 ) 
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Figure 16 Explanation of the absence of a (100) reflection from a body-centered cubic lattice. 
The phase difference between successive planes is v, so that the reflected amplitude from two 
adjacent planes is 1 + e~ llr =1 — 1 = 0. 


If all indices are even integers, S = 4/ similarly if all indices are odd integers. 
But if only one of the integers is even, two of the exponents will be odd multi- 
ples of —iir and S will vanish. If only one of the integers is odd, the same argu- 
ment applies and S will also vanish. Thus in the fee lattice no reflections can 
occur for which the indices are partly even and partly odd. 

The point is beautifully illustrated by Fig. 17: both KCl and KBr have an 
fee lattice, but n(r) for KCl simulates an sc lattice because the K + and Cl - ions 
have equal numbers of electrons. 


Atomic Form Factor 

In the expression (46) for the structure factor, there occurs the quantity/, 
which is a measure of the scattering power of thej'th atom in the unit cell. The 
value of/ involves the number and distribution of atomic electrons, and the 
wavelength and angle of scattering of the radiation. We now give a classical 
calculation of the scattering factor. 

The scattered radiation from a single atom takes account of interference 
effects within the atom. We defined the form factor in (42): 


/ = f dV rij(r) exp(— iG • r) , (49) 

with the integral extended over the electron concentration associated with a 
single atom. Let r make an angle a with G; then G • r = Gr cos a. If the elec- 
tron distribution is spherically symmetric about the origin, then 


fj = 2n / dr r 2 d{ cos a) n,(r) exp( —iGr cos a) 


= 27 t f dr rhi/r) 


e iCr _ e -icr 


iGr 
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Figure 17 Comparison of x-ray reflections from KC1 
and KBr powders. In KC1 the numbers of electrons 
of K 1 and Cl ions are equal. The scattering ampli- 
tudes /(K + ) and /(Cl - ) are almost exactly equal, so 
that the crystal looks to x-rays as if it were a 
monatomic simple cubic lattice of lattice constant 
a/2. Only even integers occur in the reflection indices 
when these are based on a cuhic lattice of lattice con- 
stant a. In KBr the form factor of Br“ is quite differ- 
ent to that of K + , and all reflections of the fee 
lattice are present. (Courtesy of R. van Nordstrand. ) 
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after integration over d { cos a) between —1 and 1. Thus the form factor is 
given by 


>2 _ sm 


f,- \ir f dr n,{r)r 


sics 

C*r 


,r<* l • I * '»"*! 


Gr 


(50) 


If the same total electron density were concentrated at r = 0, only Gr = 0 
would contribute to the integrand. In this limit (sin Gr)/Gr = 1, and 

fj = 47r f dr n/rjr 2 = Z , (51) 

the number of atomic electrons. Therefore/ is the ratio of the radiation ampli- 
tude scattered by the actual electron distribution in an atom to that scattered 
by one electron localized at a point. In the forward direction G = 0, and/ 
reduces again to the value Z. 

The overall electron distribution in a solid as seen in x-ray diffraction is 
fairly close to that of the appropriate free atoms. This statement does not 
mean that the outermost or valence electrons are not redistributed somewhat 
in forming the solid; it means only that the x-ray reflection intensities are 
represented well by the free atom values of the form factors and are not very 
sensitive to small redistributions of the electrons. 
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SUMMARY 

Various statements of the Bragg condition: 

2d sin 8 = nA ; Ak = G ; 
Laue conditions: 

• Ak = 2tw 1 ; a 2 • Ak = 2tw 2 


2k • G = G 2 


■ Ak = 2ttv-x 


The primitive translation vectors of the reciprocal lattice are 


b 3 = 27 t 


a 2 X a 3 

ai ■ a ^ X a 3 


bo = 2rr 


a 3 X a l 
a l‘ a 2 a 3 


b 3 = 27T 


aj X a2 

»j • Sj X a 3 


Here a 1; a 2 , a 3 are the primitive translation vectors of the crystal lattice. 

• A reciprocal lattice vector has the form 

G = Ujbj + t>2*>2 + «3 b 3 . 
where t> 2 > v 3 are integers or zero. 

• The scattered amplitude in the direction k' = k + Ak = k + G is propor- 
tional to the geometrical structure factor: 

S G = Xfj exp(-irj • G) = exp[-i27r(x ; e 1 + yp 2 + Zf 3 )] , 


where j runs over the s atoms of the basis, and^j is the atomic form factor 
(49) of the jth atom of the basis. The expression on the right-hand side is 
written for a reflection (u 1 t> 2 U 3 ), for which G = u 1 b 1 + v 2 h 2 + u 3 b 3 . 

• Any function invariant under a lattice translation T may be expanded in a 
Fourier series of the form 


n(r) = X n c exp(iG • r) . 

G 

• The first Brillouin zone is the Wigner-Seitz primitive cell of the reciprocal 
lattice. Only waves whose wavevector k drawn from the origin terminates on 
a surface of the Brillouin zone can be diffracted by the crystal. 


• Crystal lattice 
Simple cubic 
Body-centered cubic 
F ace-centered cubic 


First Brillouin zone 
Cube 

Rhombic dodecahedron (Fig. 13) 
Truncated octahedron (Fig. 15) 


Problems 

1. Interplanar separation. Consider a plane hkl in a crystal lattice, (a) Prove that the 
reciprocal lattice vector G = /ibj + fcb 2 + ?b t is perpendicular to this plane, (b) 
Prove that the distance between two adjacent parallel planes of the lattice is 
d(hkl ) = 2ir/jG|. (c) Show for a simple cubic lattice that d 2 = a 2 /(h 2 + k 2 + l 2 ). 


2. Hexagonal space lattice. The primitive translation vectors of the hexagonal space 
lattice may be taken as 

aj = (3 1/2 a/2)x + {a/ 2)y ; % = — (3 1/2 «/2)x + (<i/2)y ; a 3 = ci . 

(a) Show that the volume of the primitive cell is (3 W /2)«V 

(b) Show that the primitive translations of the reciprocal lattice are 

bj = (27r/3 |/2 fl)x + (2ir/a)y ; b 2 — — (2 tt/3 i/2 <j)x + (%v!a)y ; b 3 = (2tt/c)x , 

so that the lattice is its own reciprocal, but with a rotation of axes. 

(c) Describe and sketch the first Brillouin zone of the hexagonal space lattice. 

3. Volume of Brillouin zone. Show that the volume of the first Brillouin zone is 
(iTrf/V^ where V c is the volume of a crystal primitive cell. Hint: The volume of a 
Brillouin zone is equal to the volume of the primitive parallelepiped in Fourier 
space. Recall the vector identity (c X a) X (a X b) = (c - a X b)a . 

4. Width of diffraction maximum. We suppose that in a linear crystal there are 
identical point scattering centers at every lattice point p m = raa, where m is an inte- 
ger. By analogy with (20), the total scattered radiation amplitude will be proportional 
to F = 2 exp[— ima ■ Ak]. The sum over M lattice points is 

1 — exp[— iM(a ■ Ak] 

1 — exp[— i(a • Ak)] 

by the use of the series 

m=0 1-X ' 

(a) The scattered intensity is proportional to |F| 2 . Show that 

<F = sin^M(a^Ak) 
sin 2 1 (a ■ Ak) 

(b) We know that a diffraction maximum appears when a • Ak = 2irh, where h is an 
integer. We change Ak slightly and define e in a ■ Ak = 2vh + e such that e gives 
the position of the first zero in sin |M(a • Ak). Show that e — 2 tt/M, so that the width 
of the diffraction maximum is proportional to 1/M and can be extremely narrow for 
macroscopic values of M. The same result holds true for a three-dimensional crystal. 

5. Structure factor of diamond. The crystal structure of diamond is described in 
Chapter 1. The basis consists of eight atoms if the cell is taken as the conventional 
cube, (a) Find the structure factor S of this basis, (b) Find the zeros of S and show 
that the allowed reflections of the diamond structure satisfy tq + t> 2 + v 3 = 4«, 
where all indices are even and n is any integer, or else all indices are odd (Fig. 18). 
(Notice that h , k, l may be written for u 1; v 2 , v 3 and this is often done.) 

6. Form factor of atomic hydrogen. For the hydrogen atom in its ground state, the 
number density is n(r) = (irao) -1 exp{— 2r/ao), where a 0 is the Bohr radius. Show that 
the form factor is f G = 16/(4 + C 2 a%') 2 . 
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Figure 18 Neutron diffraction pattern for powdered diamond. (After G. Bacon.) 


) 

7. Diatomic line. Consider a line of atoms ABAB . . . AB, with an A — B bond length 
of \a. The form factors are/ A ,/ B for atoms A, B, respectively. The incident beam of 
x-rays is perpendicular to the line of atoms, (a) Show that the interference condition 
is nA = a cos 0, where 6 is the angle between the diffracted beam and the line of 
atoms, (b) Show that the intensity of the diffracted beam is proportional to \f A —f B p 
for n odd, and to |/ A +f B | 2 for n even, (c) Explain what happens if/ A = f B . 
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Figure 1 The principal types of crystalline binding. Ill (a) neutral atoms with closed electron 
shells are bound together weakly by tire van der Waals forces associated with fluctuations in the 
charge distributions. In (b) electrons are transferred from the alkali atoms to the halogen atoms, 
and the resulting ions are held together by attractive electrostatic forces between the positive and 
negative ions. In (c) the valence electrons are taken away from each alkali atom to form a commu- 
nal electron sea in which the positive ions arc dispersed. In (cl) the neutral atoms are hound to- 
gether by the overlapping parts of their electron distributions. 







CHAPTER 3: CRYSTAL BINDING AND ELASTIC CONSTANTS 


In this chapter we are concerned with the question: What holds a crystal 
together? The attractive electrostatic interaction between the negative charges 
of the electrons and the positive charges of the nuclei is entirely responsible 
for the cohesion of solids. Magnetic forces have only a weak effect on cohe- 
sion, and gravitational forces are negligible. Specialized terms categorize dis- 
tinctive situations: exchange energy, van der Waals forces, and covalent bonds. 
The observed differences between the forms of condensed matter are caused 
in the final analysis by differences in the distribution of the outermost elec- 
trons and the ion cores (Fig. 1). 

The cohesive energy of a crystal is defined as the energy that must be 
added to the crystal to separate its components into neutral free atoms at rest, 
at infinite separation, with the same electronic configuration. The term lattice 
energy is used in the discussion of ionic crystals and is defined as the energy 
that must be added to the crystal to separate its component ions into free ions 
at rest at infinite separation. 

Values of the cohesive energy of the crystalline elements are given in 
Table 1. Notice the wide variation in cohesive energy between different 
columns of the periodic table. The inert gas crystals are weakly bound, with 
cohesive energies less than a few percent of the cohesive energies of the ele- 
ments in the C, Si, Ge . . . column. The alkali metal crystals have intermediate 
values of the cohesive energy. The transition element metals (in the middle 
columns) are quite strongly bound. The melting temperatures (Table 2) and 
bulk modulii (Table 3) vary roughly as the cohesive energies. 

CRYSTALS OF INERT GASES 

The inert gases form the simplest crystals. The electron distribution is 
very close to that of the free atoms. Their properties at absolute zero are sum- 
marized in Table 4. The crystals are transparent insulators, weakly bound, with 
low melting temperatures. The atoms have very high ionization energies (see 
Table 5). The outermost electron shells of the atoms are completely filled, and 
the distribution of electron charge in the free atom is spherically symmetric. 
In the crystal the inert gas atoms pack together as closely as possible 1 : the 


'Zero-point motion of the atoms (kinetic energy at absolute zero) is a quantum effect that plays 
a dominant role in He 3 and He 4 . They do not solidify at zero pressure even at absolute zero temp- 
erature. The average fluctuation at 0 K of a He atom from its equilibrium position is of the order of 
30 to 40 percent of the nearest-neighbor distance. The heavier the atom, the less important the zero- 
point effects. If we omit zero-point motion, we calculate a molar volume of 9 cm 3 mol 1 for solid 
helium, as compared with the observed values of 27.5 and .36.8 cm 3 mol" 1 for liquid He 4 and liquid 
He 3 , respectively. 
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Table 1 Cohesive energies 

Energy required to form separated neutral atoms in their ground 
electronic state from the solid at 0 K at 1 atm. The data were supplied by 
Prof. Leo Brewer. 


eV/atoin 


kcal/mol 


Cr 

395. 

4.10 

Mn 

282. 

2.92 

94 5 

67.4 


As 

285.3 

2.96 

Se 

237 

2.46' 

68 2 

56.7 


W 

859. 

3.90 

Re 

775. 

8 03 

205.2 

185.2 


Ni 

428. 

4.44 

Cu 

336. 

3.49 

102.4 

80.4 





































































Li 

453.7 


Na 

371.0 


K 

336.3 


Rb 

312.6 


Cs 

301.6 


Fr 



Table 2 Melting points, in K. 

(After R. H. Lamoreaux) 


B C 

2365 





At Si 
933.5 1687 


N O F Ne 
63.15 54.36 53.48 24.56 


S Cl Ar 
388.4 172.2 83.81 



Ca Sc Ti V Cr Mn Fe Co Ni Cu Zn Ga Ge As Se Br Kr 

1113 1814 1946 2202 2133 1520 1811 1770 1728 1358 692.7 302.9 1211 1089 494 265.9 115.8 


Sr Y Zr Nb Mo Tc Ru Rh Pd Ag Cd In Sn Sb Te I Xe 

1042 1801 2128 2750 2895 2477 2527 2236 1827 1235 594.3 429.8 505.1 903.9 722.7 386.7 161.4 


Ba La Hf Ta W Re Os Ir Pt Au Hg TI Pb Bi Po At Rn 

1002 1194 2504 3293 3695 3459 3306 2720 2045 1338 234.3 577 600.7 544.6 527 


Ra Ac 
973 1324 




Ce Pr Nd Pm Sm Eu Gd Tb Dy Ho Er Tm Yb Lu 

1072 1205 1290 1346 1091 1587 1632 1684 1745 1797 1820 1098 1938 


Th Pa U Np Pu Am Cm Bk Cf Es Fm Md No Lw 

2031 1848 1406 910 913 1449 1613 1562 
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Bulk modulus in units 10 12 dyn/cm 2 or 10” N/m 2 


Compressibility in units 10 


12 cm^dyn or 1 0 


Ar tat 

0.013 

* 


Ni 

1.86 

0.538 


Ag 

1.007 

0.993 


Sr 

0.116 

8.62 


Pt 

2.783 

0.359 


Ne id] 

0.010 

IDO 


Co 

1.914 

0.522 


K 

0.032 

31. 


Rh 

2.704 

0.369 


0.068 

14.7 


0.152 

6.58 


Table 3 Isothermal bulk modulii and compressibilities at room 
temperature 

After K. Gschneidner, Jr., Solid State Physics 16, 275-426 (1964); several 
data are from F. Birch, in Handbook of physical constants, Geological Soci- 
ety of America Memoir 97, 107-173 (1966). Original references should be 
consulted when values are needed for research purposes. Values in paren- 
theses are estimates. Letters in parentheses refer to the crystal form. Let- 
ters in brackets refer to the temperature: 


[a] = 77 K; [b] = 273 K; [c] = 1 K; [d] = 4 K; [e] = 81 K. 


1.808 

0.553 


0.031 

32. 
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Table 4 Properties of inert gas crystals 

(Extrapolated to 0 K and zero pressure) 


Parameters in 



Nearest- 
neighbor 
distance, 
in A 

Experimental 

cohesive 

energy 

Melting 
point, K 

Ionization 
potential 
of free 
atom, eV 

Lennard-Jones 
potential, Eq. 10 

€, 

in 10“ 16 erg 

<?> 
in A 

kj/mol 

eV/atom 

He 

(liquid at zero pressure) 


24.58 

14 

2.56 

Ne 

3.13 

1.88 

0.02 

24.56 

21.56 

50 

2.74 

Ar 

3.76 

7.74 

0.080 

83.81 

15.76 

167 

3.40 

Kr 

4.01 

11.2 

0.116 

115.8 

14.00 

225 

3.65 

Xe 

4.35 

16.0 

0.17 

161.4 

12.13 

320 

3.98 


crystal structures (Fig. 2) are all cubic close-packed (fee), except He 3 
and He 4 . 

What holds an inert gas crystal together? The electron distribution in the 
crystal is not significantly distorted from the electron distribution around the 
free atoms because not much energy is available to distort the free atom 
charge distributions. The cohesive energy of an atom in the crystal is only 
1 percent or less of the ionization energy of an atomic electron. Part of this 
distortion gives the van der Waals interaction. 

Van der Waals-London Interaction 

Consider two identical inert gas atoms at a separation R large in compari- 
son with the radii of the atoms. What interactions exist between the two neu- 
tral atoms? If the charge distributions on the atoms were rigid, the interaction 
between atoms would be zero, because the electrostatic potential of a spheri- 
cal distribution of electronic charge is canceled outside a neutral atom by the 
electrostatic potential of the charge on the nucleus. Then the inert gas atoms 
could show no cohesion and could not condense. But the atoms induce dipole 
moments in each other, and the induced moments cause an attractive interac- 
tion between the atoms. 

As a model, we consider two identical linear harmonic oscillators 1 and 2 
separated by R. Each oscillator bears charges ±e with separations Xi and x 2 , as in 
Fig. 3. The particles oscillate along the x axis. Letpi and p 2 denote the momenta. 
The force constant is C. Then the hamiltonian of the unperturbed system is 

^ = 2m + + 2m P * + ^ ' (1) 

Each uncoupled oscillator is assumed to have the frequency <o 0 of the 
strongest optical absorption line of the atom. Thus C = moo q. 
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H 

13.595 


He 

24.58 

78.98 


Cu 

7.72 

27.93 


Sc 

6.56' ■ 
19.45 


Ti 

6,83 

20.46 


Cr 

6,76 

23.25 


Zn 

9.39 

27.35 


Ga 

6.00 

26.51 


Se 

9.75 

31.2 


Ge 

7.88 
23.81 1 


Br 

11.84 

33.4 


Zr 

6.95 

20.98 


Cd 

8.99 

25.89 


Sn 

7.34 

21.97 


V 

6.74 

21.39 


Fe 

7.90 

24.08 


Co 

.7.86 

24.9: 


Ni 

7.63 

25.78 


As 

9.81 

30.0 


Kr 

1.4.00 

38.56 


Pd 

8.33 

27.75 


Ag 

7.57 

29.05 


Xe 

12.13 

33.3 


Rb 

4.18 

31.7 


Y 

6.5 

18.9 


.7.18 

23.25 


5.78 

24.64 


Table 5 Ionization energies 

The total energy required to remove the first two electrons is the sum of the 
first and second ionization potentials. (Source: National Bureau of Standards 
Circular 467.) 
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Figure 2 Cubic close-packed (fee) crystal structure of the inert gases Ne, Ar, Kr, and Xe. The lat- 
tice parameters of the cubic cells are 4.46, 5.31, 5.64, and 6.13 A, respectively, at 4 K. 


-* 2 - 






-*i H 


Figure 3 Coordinates of the two oscillators. 


Let be the coulomb interaction energy of the two oscillators. The 
geometry is shown in the figure. The intemuclear coordinate is R. Then 


= £ + ■ 


(CGS) R'R + x,-x 2 R + x 1 R x 2 ’ 

in the approximation |x, |, |x 2 | < R we expand (2) to obtain in lowest order: 

_ 2e 2 x 1 x 2 

= 


( 2 ) 


(3) 


The total hamiltonian with the approximate form (3) for can be diago- 
nalized by the normal mode transformation 


x s — (x : + X 2 ) ; 


Xfl ^ vl {Xl ~ * 2) 


(4) 



(5) 


or, on solving for x, and x 2 , 

1 


Xl = — (X s + X a ) ; 


x 2 = — (x s ~x a ) . 


The subscripts s and a denote symmetric and antisymmetric modes of motion. 
Further, we have the momenta p s , p a associated with the two modes: 


Pl=^(Ps+Pa) l P2 = Vf - Pa) ' 


(6) 


The total hamiltonian M 0 + 3f , after the transformations (5) and (6) is 



The two frequencies of the coupled oscillators are found by inspection of (7) to be 



with oj 0 given by {C/m) 1/z . In (8) we have expanded the square root. 

The zero point energy of the system is | h(a) s + &>„); because of the interac- 
tion the sum is lowered from the uncoupled value 2 • |/L<n 0 by 

At/ = fft(A a, + AwJ = | • (9) 

This attractive interaction varies as the minus sixth power of the separation of 
the two oscillators. 

This is called the van der Waals interaction, known also as the London in- 
teraction or the induced dipole-dipole interaction. It is the principal attractive 
interaction in crystals of inert gases and also in crystals of many organic mole- 
cules. The interaction is a quantum effect, in the sense that A U —*■ 0 as h — > 0. 
Thus the zero point energy of the system is lowered by the dipole-dipole cou- 
pling of Eq. (3). The van der Waals interaction does not depend for its exis- 
tence on any overlap of the charge densities of the two atoms. 

An approximate value of the constant A in (9) for identical atoms is given 
by hco Q a ", where h(o 0 is the energy of the strongest optical absorption line and 
a is the electronic polarizability (Chapter 15). 


Repulsive Interaction 

As the two atoms are brought together, their charge distributions gradually 
overlap (Fig. 4), thereby changing the electrostatic energy of the system. At 
sufficiently close separations the overlap energy is repulsive, in large part be- 
cause of the Pauli exclusion principle. The elementary statement of the 
principle is that two electrons cannot have all their quantum numbers equal. 
When the charge distributions of two atoms overlap, there is a tendency for 
electrons from atom B to occupy in part states of atom A already occupied by 
electrons of atom A, and vice versa. 
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Figure 4 Electronic charge distribu- 
tions overlap as atoms approach. The 
solid circles denote the nuclei. 





He 

lst2st 
Total spin one 


Total electron 
energy: —59.38 eV 


Figure 5 The effect of Pauli principle on the repulsive energy: in an extreme example, two hydro- 
gen atoms are pushed together until the protons are almost in contact. The energy of the electron 
system alone can be taken from observations on atomic He, which has two electrons. In (a) the elec- 
trons have antiparallel spins and the Pauli principle has no effect: the electrons are bound by 
—78.98 eV. In (b) the spins are parallel: the Pauli principle forces the promotion of an electron from 
a Is t orbital of H to a 2s \ orbital of He. The electrons now are bound by -59.38 eV, less than (a) 
by 19.60 eV. This is the amount by which the Pauli principle has increased the repulsion. We have 
omitted the repulsive coulomb energy of the two protons, which is the same in both (a) and (b). 


The Pauli principle prevents multiple occupancy, and electron distribu- 
tions of atoms with closed shells can overlap only if accompanied by the partial 
promotion of electrons to unoccupied high energy states of the atoms. Thus 
the electron overlap increases the total energy of the system and gives a repul- 
sive contribution to the interaction. An extreme example in which the overlap 
is complete is shown in Fig. 5. 

We make no attempt here to evaluate the repulsive interaction 2 from first 
principles. Experimental data on the inert gases can be fitted well by an empirical 
repulsive potential of the form B/R 12 , where B is a positive constant, when used 


2 The overlap energy naturally depends on the radial distribution of charge about each atom, 
The mathematical calculation is always complicated even if the charge distribution is known. 
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Figure 6 Form of the Lennard- Jones potential (10) which describes the interaction of two inert gas 
atoms. The minimum occurs at fl/tr = 2 176 = 1.12. Notice how steep the curve is inside the minimum, 
and how flat it is outside the minimum. The value of U at the minimum is ^€; and U = 0 at R = a. 

together with a long-range attractive potential of the form of (9). The constants A 
and B are empirical parameters determined from independent measurements 
made in the gas phase; the data used include the virial coefficients and the viscos- 
ity. It is usual to write the total potential energy of two atoms at separation R as 

W-4c[(|)“-(|)'], (10) 

where e and <r are the new parameters, with 4ecr B = A and 4 e<r 12 = B . The 
potential (10) is known as the Lennard-Jones potential. Fig. 6. The force 
between the two atoms is given by —dU/dR. Values of e and <r given in Table 4 
can be obtained from gas-phase data, so that calculations on properties of the 
solid do not involve disposable parameters. 

Other empirical forms for the repulsive interaction are widely used, in par- 
ticular the exponential form A exp(— RJp), where p is a measure of the range of 
the interaction. This is generally as easy to handle analytically as the inverse 
power law form. 

Equilibrium Lattice Constants 

If we neglect the kinetic energy of the inert gas atoms, the cohesive en- 
ergy of an inert gas crystal is given by summing the Lennard-Jones potential 
(10) over all pairs of atoms in the crystal. If there are N atoms in the crystal, 
the total potential energy is 
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where p^R is the distance between reference atom i and any other atom j, ex- 
pressed in terms of the nearest-neighbor distance R. The factor \ occurs with 
the N to compensate for counting twice each pair of atoms. 

The summations in (11) have been evaluated, and for the fee structure 

2'p,J 12 = 12.13188 ; = 14.45392 . (12) 

j i 

There are 12 nearest-neighbor sites in the fee structure; we see that the series 
are rapidly converging and have values not far from 12. The nearest neighbors 
contribute most of the interaction energy of inert gas crystals. The corre- 
sponding sums for the hep structure are 12.13229 and 14.45489. 

If we take U tot in (11) as the total energy of the crystal, the equilibrium 
value R 0 is given by requiring that U tot be a minimum with respect to variations 
in the nearest-neighbor distance ft: 

(12)(12.13)^ — (6X14.45) , (13) 

whence 


dU* 

dR 


= 0 = -2 Ne 


Rq/ct — 1.09 , (14) 

the same for all elements with an fee structure. The observed values of RJa, 
using the independently determined values of cr given in Table 4, are: 

Ne Ar Kr Xe 
R^/a 1.14 1.11 1.10 1.09 . 

The agreement with (14) is remarkable. The slight departure of R 0 /a for the 
lighter atoms from the universal value 1.09 predicted for inert gases can be ex- 
plained by zero-point quantum effects. From measurements on the gas phase 
we have predicted the lattice constant of the crystal. 


Cohesive Energy 

The cohesive energy of inert gas crystals at absolute zero and at zero pres- 
sure is obtained by substituting (12) and (14) in (11): 


UJR) = 2 Ne 


< 1213 ) (s) , 8 -< 1445 ) (s) 6 . • 


(15) 


and, at ft = ft 0 , 

tUflo) = -(2.15)(4Ne) , (16) 

the same for all inert gases. This is the calculated cohesive energy when the 
atoms are at rest. Quantum-mechanical corrections act to reduce the binding 
by 28, 10, 6, and 4 percent of Eq. (16) for Ne, Ar, Kr, and Xe, respectively. 


The heavier the atom, the smaller the quantum correction. We can under- 
stand the origin of the quantum correction by consideration of a simple model 
in which an atom is confined by fixed boundaries. If the particle has the quan- 
tum wavelength A, where A is determined by the boundaries, then the particle 
has kinetic energy p 2 /2M = (7t/A f /2M with the de Broglie relation p = h/X for 
the connection between the momentum and the wavelength of a particle. On 
this model the quantum zero-point correction to the energy is inversely pro- 
portional to the mass. The final calculated cohesive energies agree with the ex- 
perimental values of Table 4 within 1 to 7 percent. 

One consequence of the quantum kinetic energy is that a ciystal of the iso- 
tope Ne 20 is observed to have a larger lattice constant than a crystal of Ne 22 . The 
higher quantum kinetic energy of the lighter isotope expands the lattice because 
the kinetic energy is reduced by expansion. The observed lattice constants 
(extrapolated to absolute zero from 2.5 K} are Ne 20 , 4.4644 A; Ne 22 , 4.4559 A. 

IONIC CRYSTALS 

Ionic crystals are made up of positive and negative ions. The ionic bond 
results from the electrostatic interaction of oppositely charged ions. Two com- 
mon crystal structures found for ionic crystals, the sodium chloride and the ce- 
sium chloride structures, were shown in Chapter 1. 

The electronic configurations of all ions of a simple ionic crystal corre- 
spond to closed electronic shells, as in the inert gas atoms. In lithium fluoride 
the configuration of the neutral atoms are, according to the periodic table in 
the front endpapers of this book, Li: ls 2 2s, F: ls 2 2s 2 2p 5 . The singly charged 
ions have the configurations Li + :ls 2 , F“: ls 2 2s 2 2p ( \ as for helium and neon, re- 
spectively. Inert gas atoms have closed shells, and the charge distributions are 
spherically symmetric. We expect that the charge distributions on each ion in 
an ionic crystal will have approximately spherical symmetry, with some distor- 
tion near the region of contact with neighboring atoms. This picture is con- 
firmed by x-ray studies of electron distributions (Fig. 7). 

A quick estimate suggests that we are not misguided in looking to electro- 
static interactions for a large part of the binding energy of an ionic crystal. The 
distance between a positive ion and the nearest negative ion in crystalline 
sodium chloride is 2.81 X 10 -8 cm, and the attractive coulomb part of the 
potential energy of the two ions by themselves is 5.1 eV. This value may be 
compared (Fig. 8) with the experimental value of 7.9 eV per molecular unit for 
the lattice energy of crystalline NaCl with respect to separated Na + and CL 
ions. We now calculate the energy more closely. 

Electrostatic or Madelung Energy 

The long-range interaction between ions with charge ±q is the electrostatic 
interaction ±q 2 /r, attractive between ions of opposite charge and repulsive 
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Figure 7 Electron density distribution in the 
base plane of NaCl, after x-ray studies by G. 
Schoknecht. The numbers on the contours give 
the relative electron concentration. 



+ 5.14 eV 


Ionization 

energy 




Gas 



Electron 
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Crystal 


+ 7.9 eV 
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Figure 8 The energy per molecule unit of a crys- 
tal of sodium chloride is (7.9 — 5.1 + 3.6) = 6.4 eV 
lower than the energy of separated neutral atoms. 
The lattice energy with respect to separated ions 
is 7.9 eV per molecule unit. All values on the fig- 
ure are experimental. Values of the ionization en- 
ergy are given in Table 5, and values of the elec- 
tron affinity are given in Table 6. 


between ions of the same charge. The ions arrange themselves in whatever crys- 
tal structure gives the strongest attractive interaction compatible with the repul- 
sive interaction at short distances between ion cores. The repulsive interactions 
between ions with inert gas configurations are similar to those between inert gas 
atoms. The van der Waals part of the attractive interaction in ionic crystals 
makes a relatively small contribution to the cohesive energy in ionic crystals, of 
the order of 1 or 2 percent. The main contribution to the binding energy of ionic 
crystals is electrostatic and is called the Madelung energy. 


Table 6 Electron affinities of negative ions 


The electron affinity is positive for a stable negative ion. 


Atom 

Electron affinity energy eV 

Atom 

Electron affinity energy eV 

H 

0.7542 

Si 

1.39 

Li 

0.62 

P 

0.74 

C 

1.27 

S 

2.08 

0 

1.46 

Cl 

3.61 

F 

3.40 

Br 

3.36 

Na 

0.55 

I 

3.06 

A1 

0.46 

K 

0.50 


Source: H. Hotop and W. C. Lineberger, J. Phys. Chera. Ref. Data 4, 539 (1975). 


If Ug is the interaction energy between ions i and j, we define a sum XJ t 
which includes all interactions involving the ion i: 

= u?) 

j 

where the summation includes all ions except j = i. We suppose that Uy may be 
written as the sum of a central field repulsive potential of the form A exp ( — r/p), 
where A and p are empirical parameters, and a coulomb potential ±<f'/r. Thus 

(CGS) Uy - A exp (— r tf /p) ± t/ 2 /r y , (18) 

where the + sign is taken for the like charges and the - sign for unlike charges. 
In SI units the coulomb interaction is ±q 2 /4:ire 0 r; we write this section in CGS 
units in which the coulomb interaction is ±q 2 /r. 

The repulsive term describes the fact that each ion resists overlap with the 
electron distributions of neighboring ions. We treat the strength A and range p 
as constants to be determined from observed values of the lattice constant and 
compressibility; we have used the exponential form of the empirical repulsive 
potential rather than the H -12 form used for the inert gases. The change is 
made because it may give a better representation of the repulsive interaction. 
For the ions, we do not have gas-phase data available to permit the indepen- 
dent determination of A and p. We note that p is a measure of the range of the 
repulsive interaction; when r = p, the repulsive interaction is reduced to e 1 
of the value at r = 0. 

In the NaCl structure the value of (7; does not depend on whether the 
reference ion i is a positive or a negative ion. The sum in (17) can be arranged 
to converge rapidly, so that its value will not depend on the site of the reference 
ion in the crystal, as long as it is not near the surface. We neglect surface effects 
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and write the total lattice energy U, ot of a crystal composed of N molecules or 
2 N ions as (7 tot = NU f . Here N, rather than 2 N, occurs because we must count 
each pair of interactions only once or each bond only once. The total lattice en- 
ergy is defined as the energy required to separate the crystal into individual 
ions at an infinite distance apart. 

It is convenient again to introduce quantities p tJ such that r, ; = p^R, where 
R is the nearest-neighbor separation in the crystal. If we include the repulsive 
interaction only among nearest neighbors, we have 


(CGS) 




Thus 


A exp (~R/p) - 
-Pij R 


(nearest neighbors) 
(otherwise). 


(CGS) 


U tot = NU i = N [z\e 


-R/p 


_W\ 

R ) ’ 


(19) 


( 20 ) 


where z is the number of nearest neighbors of any ion and 


a = 5V = Madelung constant . 

i PiJ 


( 21 ) 


The sum should include the nearest-neighbor contribution, which is just z. 
The (±) sign is discussed just before (25). The value of the Madelung constant 
is of central importance in the theory of an ionic crystal. Methods for its calcu- 
lation are discussed next. 

At the equilibrium separation dU^/dR = 0, so that 


(CGS) 


dU, Nz\ , D/ « , Nac i 2 
exp ( -R/p) + - sr 


= 0 


( 22 ) 


or 


(CGS) 


Rq exp (— B 0 /p) = pacf/zk . 


(23) 


This determines the equilibrium separation R 0 if the parameters p, A of the re- 
pulsive interaction are known. For SI, replace q 2 by q 2 /Arre 0 . 

The total lattice energy of the crystal of 2 N ions at their equilibrium sepa- 
ration R n may be written, using (20) and (23), as 

(CGS) U «* = " £) • (24) 


The term —Naq 2 /R {) is the Madelung energy. We shall find that p is of the 
order of 0.1R 0 , so that the repulsive interaction has a very short range. 
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Figure 9 Line of ions of alternating signs, with distance R between ions. 


Evaluation of the Madelung Constant 

The first calculation of the coulomb energy constant a was made by 
Madelung. A powerful general method for lattice sum calculations was devel- 
oped by Ewald and is developed in Appendix B. Computers are now used for 
the calculations. 

The definition of the Madelung constant a is, by (21), 

_ v ,(±) 

“ ? Pv ' 

For (20) to give a stable crystal it is necessary that a be positive. If we take the 
reference ion as a negative charge, the plus sign will apply to positive ions and 
the minus sign to negative ions. 

An equivalent definition is 

R = ^ ~*T ’ (25) 

where rj is the distance of the jth ion from the reference ion and R is the near- 
est-neighbor distance. The value given for a will depend on whether it is 
defined in terms of the nearest-neighbor distance R or in terms of the lattice 
parameter a or in terms of some other relevant length. 

As an example, we compute the Madelung constant for the infinite line of 
ions of alternating sign in Fig. 9. Pick a negative ion as reference ion, and let R 
denote the distance between adjacent ions. Then 


« =2 ri _j_ , j__ j, ... 

R l R 2 R 3 R 4 R 


<r = 2 [ 1 “l + 3'"4 + "'_ ; 


the factor 2 occurs because there are two ions, one to the right and one to the 
left, at equal distances r ; . We sum this series by the expansion 


„2 „3 „4 

ln(l+x)=x-|- + |--|- + ' 


Thus the Madelung constant for the one-dimensional chain is a = 2 In 2. 
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Figure 10 Energy per molecule of KCl crystal, showing Madelung (coulomb) and repulsive 
contributions. 


In three dimensions the series presents greater difficulty. It is not 
possible to write down the successive terms by a casual inspection. More 
important, the series will not converge unless the successive terms in the se- 
ries are arranged so that the contributions from the positive and negative 
terms nearly cancel. 

Typical values of the Madelung constant are listed below, based on unit 
charges and referred to the nearest-neighhor distance: 

Structure a 

Sodium chloride, NaCl 1.747565 

Cesium chloride, CsCl 1.762675 

Zinc blende, cubic ZnS 1.6381 

The Madelung and repulsive contributions to the binding of a KCl crystal 
are shown in Fig. 10. Properties of alkali halide crystals having the sodium 
chloride structure are given in Table 7. The calculated values of the lattice en- 
ergy are in exceedingly good agreement with the observed values. 


Table 7 Properties of alkali halide crystals with the NaCl structure 


All values (except those in square brackets) at room temperature and atmospheric pressure, with no correction for changes in R 0 and U from 
absolute zero. Values in square brackets at absolute zero temperature and zero pressure, from private communication by L. Brewer. 



Nearest- 

neighbor 

separation 

R 0 in A 

Bulk modulus B, 
in 10 tl dyn/cin 2 
or 10“ N/m 2 

Repulsive 
energy 
parameter 
zA, in 10 -8 erg 

Repulsive 

range 

parameter 

p , in A 

Lattice energy compared 
to free ions, in kcal/mol 

Experimental 

Calculated 

LiF 

2.014 

6.71 

0.296 

0.291 

242 . 3 [ 246 . 8 ] 

242.2 

LiCl 

2.570 

2.98 

0.490 

0.330 

198 . 9 [ 201 . 8 ] 

192.9 

LiBr 

2.751 

2.38 

0.591 

0.340 

189.8 

181.0 

Lil 

3.000 

( 1 . 71 ) 

0.599 

0.366 

177.7 

166.1 

NaF 

2.317 

4.65 

0.641 

0.290 

214 . 4 [ 217 . 9 ] 

215.2 

NaCl 

2.820 

2.40 

1.05 

0.321 

182 . 6 [ 185 . 3 ] 

178.6 

NaBr 

2.989 

1.99 

1.33 

0.328 

173 . 6 [ 174 . 3 ] 

169.2 

Nal 

3.237 

1.51 

1.58 

0.345 

163 . 21162 . 3 ] 

156.6 

KF 

2.674 

3.05 

1.31 

0.298 

189 . 8 [ 194 , 5 ] 

189.1 

KCl 

3.147 

1.74 

2.05 

0.326 

165 . 8 [ 169 . 5 ] 

161.6 

KBr 

3.298 

1.48 

2.30 

0.336 

158 . 5 [ 159 . 3 ] 

154.5 

KI 

3.533 

1.17 

2.85 

0.348 

149 . 9 [ 151 . 1 ] 

144.5 

RbF 

2.815 

2.62 

1.78 

0.301 

181.4 

180.4 

RbCl 

3.291 

1.56 

3.19 

0.323 

159.3 

155.4 

RbBr 

3.445 

1.30 

3.03 

0.338 

152.6 

148.3 

Rbl 

3.671 

1.06 

3.99 

0.348 

144.9 

139.6 


Data from various tables by M. P. Tosi, Solid State Physics 16. 1 (1964). 
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Figure 11 Calculated valence electron concentration in germanium. The numbers on the con- 
tours give the electron concentration per primitive cell, with four valence electrons per atom 
(eight electrons per primitive cell). Note the high concentration midway along the Ge-Ge bond, 
as we expect for covalent bonding. (After J. R. Chelikowsky and M. L. Cohen.) 


COVALENT CRYSTALS 

The covalent bond is the classical electron pair or homopolar bond of 
chemistry, particularly of organic chemistry. It is a strong bond: the bond be- 
tween two carbon atoms in diamond with respect to separated neutral atoms is 
comparable with the bond strength in ionic crystals. 

The covalent bond is usually formed from two electrons, one from each 
atom participating in the bond. The electrons forming the bond tend to be 
partly localized in the region between the two atoms joined by the bond. The 
spins of the two electrons in the bond are antiparallel. 

The covalent bond has strong directional properties (Fig. 11). Thus car- 
bon, silicon, and germanium have the diamond structure, with atoms joined to 
four nearest neighbors at tetrahedral angles, even though this arrangement 
gives a low filling of space, 0.34 of the available space, compared with 0.74 for 
a close-packed structure. The tetrahedral bond allows only four nearest neigh- 
bors, whereas a close-packed structure has 12. We should not overemphasize 
the similarity of the bonding of carbon and silicon. Carbon gives biology, but 
silicon gives geology and semiconductor technology. 

The binding of molecular hydrogen is a simple example of a covalent bond. 
The strongest binding (Fig. 12) occurs when the spins of the two electrons are 
antiparallel. The binding depends on the relative spin orientation not because 
there are strong magnetic dipole forces between the spins, but because the Pauli 
principle modifies the distribution of charge according to the spin orientation. 
This spin-dependent coulomb energy is called the exchange interaction. 



Figure 12 Energy of molecular hydrogen (H 2 ) referred to separated neutral atoms. A negative 
energy corresponds to binding. The curve N refers to a classical calculation with free atom charge 
densities; A is the result for parallel electron spins, taking the Pauli exclusion principle into ac- 
count, and S (the stable state) for antiparallel spins. The density of charge is represented by con- 
tour lines for the states A and S. 


The Pauli principle gives a strong repulsive interaction between atoms 
with filled shells. If the shells are not filled, electron overlap can be accommo- 
dated without excitation of electrons to high energy states and the bond will be 
shorter. Compare the bond length (2 A) of Cl 2 with the interatomic distance 
(3.76 A) of Ar in solid Ar; also compare the cohesive energies given in Table 1. 
The difference between Cl 2 and Ar 2 is that the Cl atom has five electrons in 
the 3 p shell and the Ar atom has six, filling the shell, so that the repulsive in- 
teraction is stronger in Ar than in Cl. 

The elements C, Si, and Ge lack four electrons with respect to filled 
shells, and thus these elements (for example) can have an attractive interaction 
associated with charge overlap. The electron configuration of carbon is 
ls 2 2s 2 2p 2 . To form a tetrahedral system of covalent bonds the carbon atom 
must first be promoted to the electronic configuration l.s 2 2s2p 3 . This promo- 
tion from the ground state requires 4 eV, an amount more than regained when 
the bonds are formed. 
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Table 8 Fractional ionic character of bonds in binary crystals 


Crystal 

Fractional 
ionic character 

Crystal 

Fractional 
ionic character 

Si 

0.00 



SiC 

0.18 

GaAs 

0.31 

Ge 

0.00 

GaSb 

0.26 

ZnO 

0.62 

AgCl 

0.86 

ZnS 

0.62 

AgBr 

0.85 

ZnSe 

0.63 

Agl 

0.77 

ZnTe 

0.61 

MgO 

0.84 

CdO 

0.79 

MgS 

0.79 

CdS 

0.69 

MgSe 

0.79 

CdSe 

0.70 



CdTe 

0.67 

LiF 

0.92 



NaCl 

0.94 

InP 

0.42 

RbF 

0.96 

InAs 

0.36 



InSb 

0.32 




After J. C. Phillips, Bonds and bands in semiconductors. 


There is a continuous range of crystals between the ionic and the covalent 
limits. It is often important to estimate the extent a given bond is ionic or cova- 
lent. A semiempirical theory of the fractional ionic or covalent character of a 
bond in a dielectric crystal has been developed with considerable success by 
J. C. Phillips, Table 8. 


METALS 

Metals are characterized by high electrical conductivity, and a large num- 
ber of electrons in a metal are free to move about, usually one or two per atom. 
The electrons available to move about are called conduction electrons. The 
valence electrons of the atom become the conduction electrons of the metal. 

In some metals the interaction of the ion cores with the conduction elec- 
trons always makes a large contribution to the binding energy, but the charac- 
teristic feature of metallic binding is the lowering of the energy of the valence 
electrons in the metal as compared with the free atom. 

The binding energy of an alkali metal crystal is considerably less than that 
of an alkali halide crystal: the bond formed by a conduction electron is not very 
strong. The interatomic distances are relatively large in the alkali metals because 
the kinetic energy of the conduction electrons is lower at large interatomic 
distances. This leads to weak binding. Metals tend to crystallize in relatively 
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Figure 13 The hydrogen difluoride ion HFJ 
is stabilized by a hydrogen bond. The sketch 
is of an extreme model of the bond, extreme 
in the sense that the proton is shown bare of 
electrons. 



close packed structures: hep, fee, bcc, and some other closely related structures, 
and not in loosely-packed structures such as diamond. 

In the transition metals there is additional binding from inner electron shells. 
Transition metals and the metals immediately following them in the periodic 
table have large d-electron shells and are characterized by high binding energy. 


HYDROGEN BONDS 

Because neutral hydrogen has only one electron, it should form a covalent 
bond with only one other atom. It is known, however, that under certain condi- 
tions an atom of hydrogen is attracted by rather strong forces to two atoms, 
thus forming a hydrogen bond between them, with a bond energy of the 
order of 0.1 eV. It is believed that the hydrogen bond is largely ionic in charac- 
ter, being formed only between the most electronegative atoms, particularly F, 
O, and N. In the extreme ionic form of the hydrogen bond, the hydrogen atom 
loses its electron to another atom in the molecule; the bare proton forms the 
hydrogen bond. The atoms adjacent to the proton are so close that more than 
two of them would get in each other s way; thus the hydrogen bond connects 
only two atoms (Fig. 13). 

The hydrogen bond is an important part of the interaction between H 2 0 
molecules and is responsible together with the electrostatic attraction of the 
electric dipole moments for the striking physical properties of water and ice. It 
is important in certain ferroelectric crystals and in DNA. 


ATOMIC RADII 

Distances between atoms in crystals can be measured very accurately by 
x-ray diffraction, often to 1 part in 10 5 . Can we say that the observed distance 
between atoms may be assigned partly to atom A and partly to atom B? Can a 
definite meaning be assigned to the radius of an atom or an ion, irrespective of 
the nature and composition of the crystal? 

Strictly, the answer is no. The charge distribution around an atom is not 
limited by a rigid spherical boundary. Nonetheless, the concept of an atomic 




Table 9 Atomic and ionic radii 

Values approximate only. Units are 1 A=10 -10 m. For original references 
see W. B. Pearson, Crystal chemistry and physics of metals and alloys, 
Wiley, 1972. 

uimtftHMJuiiiimMiimimmmnf t 


Mg 

0.65 * Standard radii for ions in inert gas (filled shell) configuration 

1 .40 * — — Radii of atoms when in tetrahedral covalent bonds 

1.60 * Radii of ions in 12-coordinated metals 


N 0 

1.71 1.40 

0.70 0.66 

P S 

2.12 1.84 

1.10 1.04 


Ga 

0.62 

1.26 

1.41 


Kr 

2.00 

# I Iff! 


Se 

1.98 

1.14 


1.64 


In 

0.81 
j 1.44 
1.66 


Xe 

2.17 


Cd 

0.97 

1.48 

1.57 


Sn 

0.71 

1.40 

1.55 


Sb 

2.45 

1.36 

1.59 


Ag 

1.26 

1.52 

1.45 


1.80 


Au 

1.37 

1.44 


Fr 

1.75 


Ra 

1.37 


Ce 

1.01 


Ar 

1.88 


Zn 

0.74 

1.31 

1.39 


Br 

1.95 

1.11 


Rb 

1.48 

Iff If 

2.55 


Cs 

1.67 

2.73 


| J'f f jp 

1.25 


1.35 

1.28 


1.38 


iffif 

1.47 


Tl 

0.95 


1.72 


Pb 

0.84 


1.75 


, 

1.25 


As 

2.22 

1.18 


1.39 
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radius is fruitful in predicting interatomic spacing. The existence and probable 
lattice constants of phases that have not yet been synthesized can be predicted 
from the additive properties of the atomic radii. Further, the electronic config- 
uration of the constituent atoms often can be inferred by comparison of mea- 
sured and predicted values of the lattice constants. 

To make predictions of lattice constants it is convenient to assign (Table 9) 
sets of self-consistent radii to various types of bonds: one set for ionic crystals 
with the constituent ions 6-coordinated in inert gas closed-shell configura- 
tions, another set for the ions in tetrahedrally-eoordinated structures, and an- 
other set for 12-coordinated (close-packed) metals. 

The predicted self-consistent radii of the cation Na + and the anion F - as 
given in Table 9 would lead to 0.97 A + 1.36 A = 2.33 A for the interatomic 
separation in the crystal NaF, as compared with the observed 2.32 A. This 
agreement is much better than if we assume atomic (neutral) configurations 
for Na and F, for this would lead to 2.58 A for the interatomic separation in the 
crystal. The latter value is |(n.n. distance in metallic Na+ interatomic distance 
in gaseous F 2 ). 

The interatomic distance between C atoms in diamond is 1.54 A; one-half 
of this is 0.77 A. In silicon, which has the same crystal structure, one-half the 
interatomic distance is 1.17 A. In SiC each atom is surrounded by four atoms 
of the opposite kind. If we add the C and Si radii just given, we predict 1.94 A 
for the length of the C-Si bond, in fair agreement with the 1.89 A observed for 
the bond length. This is the kind of agreement (a few percent) that we shall 
find in using tables of atomic radii. 

Ionic Crystal Radii 

Table 9 gives the ionic crystal radii in inert gas configurations for 6-fold 
coordination. The ionic radii can be used in conjunction with Table 10. Let us 


Table 10 Use of the standard radii of ions given in Table 9 

The interionic distance D is represented by D N = R c + R A + A v , for ionic crystals, 
where N is the coordination number of the cation (positive ion), R c and R A are the stan- 
dard radii of the cation and anion, and A N is a correction for coordination number. 
Room temperature. (After Zachariasen.) 


N 

A n (A) 

N 

A*(A) 

N 

a«(A) 

1 

-0.50 

5 

-0.05 

9 

+0.11 

2 

-0.31 

6 

0 

10 

+0.14 

3 

-0.19 

7 

+0.04 

11 

+0.17 

4 

-0.11 

8 

+0.08 

12 

+0.19 


1 
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consider BaTiO s with a lattice constant of 4.004 A at room temperature. Each 
Ba ++ ion has 12 nearest 0~~ ions, so that the coordination number is 12 and 
the correction A 12 of Table 10 applies. If we suppose that the structure is 
determined by the Ba-O contacts, we have D 12 = 1.35 + 1.40 + 0.19 = 2.94 A 
or a = 4. 16 A; if the Ti-O contact determines the structure, we have D 6 = 0.68 + 
1.40 = 2.08 or a = 4.16 A. The actual lattice constant is somewhat smaller 
than the estimates and may perhaps suggest that the bonding is not purely 
ionic, but is partly covalent. 


ANALYSIS OF ELASTIC STRAINS 

We consider the elastic properties of a crystal viewed as a homogeneous 
continuous medium rather than as a periodic array of atoms. The continuum 
approximation is usually valid for elastic waves of wavelengths A longer than 
10 6 cm, which means for frequencies below 10 n or 10 12 Hz. Some of the ma- 
terial below looks complicated because of the unavoidable multiplicity of sub- 
scripts on the symbols. The basic physical ideas are simple: we use Hooke s law 
and Newton’s second law. Hooke’s law states that in an elastic solid the strain 
is directly proportional to the stress. The law applies to small strains only. We 
say that we are in the nonlinear region when the strains are so large that 
Hooke s law is no longer satisfied. 

We specify the strain in terms of the components e xx> e yy , e,, 7 e xy , e yz , e zx 
which are defined below. We treat infinitesimal strains only. We shall not 
distinguish in our notation between isothermal (constant temperature) and 
adiabatic (constant entropy) deformations. The small differences between the 
isothermal and adiabatic elastic constants are not often of importance at room 
temperature and below. 

We imagine that three orthogonal vectors x, y, z of unit length are embed- 
ded securely in the unstrained solid, as shown in Fig. 14. After a small uniform 
deformation of the solid has taken place, the axes are distorted in orientation 
and in length. In a uniform deformation each primitive cell of the crystal is 
deformed in the same way. The new axes x\ y', z' may be written in terms of 
the old axes: 

x' = (1 + ejx + e^y + e„z ; 

y' = e yx x + (1 + e yy )y + e yz i ■ (26) 

z' = eJH- e zy y + (1 + ejz . 

The coefficients e ofl define the deformation; they are dimensionless and have 
values ^ 1 if the strain is small. The original axes were of unit length, but the 
new axes will not necessarily be of unit length. For example, 

x' • x' = i + 2e„ + 4 + 4, + 4 > 
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z' 




Figure 14 Coordinate axes for the description of the state of strain; the 
orthogonal unit axes in the unstrained state (a) are deformed in the 
strained state (b). 


whence x' = 1 + H . The fractional changes of length of the x, y, and z 

axes are e^, e yy , e zz , respectively, to the first order. 

What is the effect of the deformation (26) on an atom originally at r = 
xx + yy + zz? The origin is taken at some other atom. If the deformation 
is uniform, then after deformation the point will be at the position 
r' = xx' + yy' + zz'. This is obviously correct if we choose the x axis such that 
r = xx; then r' = xx' by definition of x'. The displacement R of the deforma- 
tion is defined by 

R = r' - r = x(x' - x) + y(y' - y) + z(z' - z) , (27) 

or, from (26), 

R(r) = (x«?„ + ye lJX + zejx + (xe^+ ye m + ze zy )y 

+ (xe xx + ye yz + zeji . ( 2 8) 

This may be written in a more general form by introducing u, v, w such that 
the displacement is given by 


R(r) = u(r)x + v(r)y + w(r)z . 


(29) 


If the deformation is nonuniform we must relate u, v, w to the local strains. We 
take the origin of r close to the region of interest; then comparison of (28) and 
(29) gives, by Taylor series expansion of R using R(0) = 0, 


xe M 



du 




etc. 


(30) 
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It is usual to work with coefficients e a p rather than e aj3 . We define the 
strain components e^, e yy , e zz by the relations 


_ du 

_ _ dv 

dw 

to 5 

e yy~ t yy dy ’ 

(!zz ~ €zl dz ’ 


(31) 


using (30). The other strain components e xy , e yz , e, x are defined in terms of the 
changes in angle between the axes: using (26) we may define 




y z 



+ e. 


■*y 


+ e, 


■yz 


+ £ r - 


_ du . dv 
dy to ; 
_ dv , dw 
dz dy ’ 
_ du , dtp 
dz to ' 


(32) 


We may replace the = signs by = signs if we neglect terms of order e 2 . The six 
dimensionless coefficients e a p(=ep a ) completely define the strain. 

Dilation 

The fractional increase of volume associated with a deformation is called 
the dilation. The dilation is negative for hydrostatic pressure. The unit cube of 
edges x, y, z has a volume after deformation of 

V' = x' • y' X z' , (33) 

by virtue of a well-known result for the volume of a parallelepiped having 
edges x', y', z'. From (26) we have 


x' • y' X z' = 


1 + e x 


*-xy xz 

1 ^yy ^yz 

f "f 


= 1 + * e yy + e z 


(34) 


Products of two strain components have been neglected. The dilation S is then 
given by 



+ e, 


yy 


+ 


(35) 


Stress Components 

The force acting on a unit area in the solid is defined as the stress. There 
are nine stress components: X r> X y , X z , Y r , Y y , Y,, Z r , Z y , Z z . The capital letter 
indicates the direction of the force, and the subscript indicates the normal to 
the plane to which the force is applied. In Fig. 15 the stress component X x 
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Figure 15 Stress component X x is a force applied in the 
x direction to a unit area of a plane whose normal lies in 
the x direction; X y is applied in the x direction to a unit 
area of a plane whose normal lies in the y direction. 



Figure 16 Demonstration that for a body in static equilibrium 
Y, = X„. The sum of the forces in the i direction is zero. The sum 
of the forces in the y direction is also zero. The total force vanishes. 
The total torque about the origin is also zero if Y x = X y . 



represents a force applied in the x direction to a unit area of a plane whose 
normal lies in the x direction; the stress component X y represents a force 
applied in the x direction to a unit area of a plane whose normal lies in the y 
direction. The number of independent stress components is reduced from 
nine to six by applying to an elementary cube (as in Fig. 16) the condition that 
the angular acceleration vanish, and hence that the total torque must be zero. 
It follows that 


Y z = Z y ; Z t = X z ; Xy = Y z . (36) 

The six independent stress components may be taken as X,, Y y , Z z , Y., Z x , X y . 

Stress components have the dimensions of force per unit area or 
energy per unit volume. The strain components are ratios of lengths and are 
dimensionless. 
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ELASTIC COMPLIANCE AND STIFFNESS CONSTANTS 

Hooke's law states that for sufficiently small deformations the strain is di- 
rectly proportional to the stress, so that the strain components are linear func- 
tions of the stress components: 

= s n x r + s 12 Y y + s 13 z z + s 14 y z + s 15 z, + s 16 Xj, ; 

e yy = ^ 21 -^t d" S 2 2 Yy + $23^ + 824 Y z d~ l 

e zz = ^31-^r Y S 32 Y,, + S 33 Z, Y S^Y-j + S. 35 Z z + S^Ky ; 

e yz = Y S^Y, Y StfZz + S 44 Y, Y S,, 5 Z r + Sjfjky ; 

Csr = SjiXj + S 52 f J Y S 53 Z z + ‘S^Y, + S^Z* + SggXy ; 

e xy = Sei-^r + + S(aZ. + S^Y, + .S ft gZ r + S 66 X y . 

Y* 1' 1 l^rx Y C 12pyy Y C 13 0 zz ^ ^I4^yz~^~ f ' ! r/Yr Y C ’ | ffixy > 

Y„ — f' 2 lC rr Y f^ 22 ^yyd" Co.yy. ^ ^24^yz~^~ ^-''25^zx d~ ^2t^xy > 

Z z ~ f '.31^11 d- C^yyY 1^33^22 d- C 34 e, fZ Y f'3gC„ Y C^ X y ; 

Y. f ' 4 Yl.r Y Cq^yy Y 1^43^22 Y ^'44^,^- Y f'.-igC,, 1 C 4(j&xy , 

Z T Y ' f^53^zzd" f Y 4 1 ^5(fixy 5 

= C 61 e a Y C 62 e ra Y C63e zz + C^e^Y Cgge^Y C^e xy . 

The quantities S n , S 12 ■ ■■ are called elastic compliance constants or 
elastic constants; the quantities C u , C 12 , . . . are called the elastic stiffness 
constants or moduli of elasticity. The S’s have the dimensions of [area]/ 
[force] or [volume ]/[energy] . The C’s have the dimensions of [ force ]/[ area] or 
[energy]/[ volume] . 

Elastic Energy Density 

The 36 constants in (37) or in (38) may be reduced in number by several 
considerations. The elastic energy density U is a quadratic function of the 
strains, in the approximation of Hooke’s law (recall the expression for the energy 
of a stretched spring). Thus we may write 

0=4 (39) 

Z A = 1 1 

where the indices 1 through 6 are defined as: 

1 = XX \ 2 = yy ; 3 = ZZ ; 4 = ijz ; 5 = zx ; 6 = xy . (40) 

The C’s are related to the C’s of (38), as in (42) below. 


(37) 


(38) 


The stress components are found from the derivative of U with respect to 
the associated strain component. This result follows from the definition of 
potential energy. Consider the stress X, applied to one face of a unit cube, the 
opposite face being held at rest: 

*-£-g- e ”- + 5£< e » + C *» ' (41) 

Note that only the combination |(C a p + Cp a ) enters the stress-strain relations. 
It follows that the elastic stiffness constants are symmetrical: 


— z(C a 8 — Cp a . 


(42) 


Thus the thirty-six elastic stiffness constants are reduced to twenty-one. 


Elastic Stiffness Constants of Cubic Crystals 

The number of independent elastic stiffness constants is reduced further 
if the crystal possesses symmetry elements. We now show that in cubic crystals 
there are only three independent stiffness constants. 

We assert that the elastic energy density of a cubic crystal is 

U = kC u (ei + 4, + e%) + £C M (e^ + 4 + e%) + + e^ yy ) , ( 43 ) 

and that no other quadratic terms occur; that is, 

+ "■) ; (V^ + "') ; + ' ") (44) 

do not occur. 

The minimum symmetry requirement for a cubic structure is the exis- 
tence of four three-fold rotation axes. The axes are in the [111] and equivalent 
directions (Fig. 17). The effect of a rotation of 27r/3 about these four axes is to 
interchange the x, y, z axes according to the schemes 


x~+z->-y^x ; -r ->y->z->~x , 
according to the axis chosen. Under the first of these schemes, for exampl 

“h P^ — > P^ 4- p 2 -|- (£ 

Vyy ' ^zz 1 > 


(45) 

e, 


and similarly for the other terms in parentheses in (43). Thus (43) is invariant 
under the operations considered. But each of the terms exhibited in (44) is 
odd in one or more indices. A rotation in the set (45) can be found which will 
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change the sign of the term, because e XIJ = — for example. Thus the 
terms (44) are not invariant under the required operations. 

It remains to verify that the numerical factors in (43) are correct. By (41), 

dU/de tt = X x = Cne^ + C ]2 (e w + e- z ) . (46) 

The appearance of C u e CT agrees with (38). On further comparison, we see that 

Ci2 = Ci 3 ; C l4 = C 15 = C 16 = 0 . (47) 

Further, from (43), 


dU/d€ x j Xy C , 

on comparison with (38) we have 


^61 — ^62 — 0(53 — Cf 54 — C 6 5 — 0 ; 


Cm ~ C± 


(48) 


(49) 


Thus from (43) we find that the array of values of the elastic stiffness 
constants is reduced for a cubic crystal to the matrix 



e xx 

e yy 

e zz 

e y-~ 


&xtj 

X, 

C n 

C 12 

C 12 

0 

0 

0 

X, 

C*12 

Cn 

C X2 

0 

0 

0 

Z z 

C \2 

Ci2 

Cn 

0 

0 

0 

Y. 

0 

0 

0 

C44 

0 

0 

Z, 

0 

0 

0 

0 

C 44 

0 


0 

0 

0 

0 

0 

C44 


( 50 ) 




For cubic crystals the stiffness and compliance constants are related by 

C 14 = I/S 44 ; C n — C 12 = (Su — S 12 ) 1 ; 

C n + 2 C 12 = (S„ + 2 S 12 ) _1 . (51) 

These relations follow on evaluating the inverse matrix to (50). 

Bulk Modulus and Compressibility 

Consider the uniform dilation e xx = e yy = e zz = |S. For this deformation the 
energy density (43) of a cubic crystal is 


t/ = |(C u + 2C 12 )S 2 . (52) 

We may define the bulk modulus B by the relation 

U = lBS 2 , (53) 

which is equivalent to the definition —V dp/dV. For a cubic crystal, 

B = 3 (C n + 2C 12 ) . (54) 

The compressibility K is defined as K = 1/B. Values of B and K are given in 
Table 3. 


ELASTIC WAVES IN CUBIC CRYSTALS 


By considering as in Figs. 18 and 19 the forces acting on an element of 
volume in the crystal we obtain the equation of motion in the x direction 


= ^ ^ ax, 

P dt 2 dx dy dz ’ 


(55) 


here p is the density and u is the displacement in the x direction. There are 
similar equations for the y and z directions. From (38) and (50) it follows that 
for a cubic crystal 


d 2 u 

at 2 


n'L J£=r ZZJ5.+ C 
P ,.2 u n + °12 


ggg 

dx 


d e yy de zz 
dx dx 


de x 

+ C44 I + 

' dy dz 


"xy de. 


(56) 


here the x, y, z directions are parallel to the cube edges. Using the definitions 
(31) and (32) of the strain components we have 


■ — = c„^ + c^(^+ ° 2u 


dt 2 


dx 2 


dy dz 


+ (C 12 + C 44 )I^-+ d * w 


dx dy dx dz 


(57a) 


where u, v, w are the components of the displacement R as defined by (29). 
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Volume Ax Ay A z 


~X x (x) 


Figure 18 Cube of volume Ax Ay Az acted 
on by a stress — X r (x) on the face at x, and 

ax, 

X,(x + Ax) — X,(x) + -j— Ax on the parallel 

face at x + Ax. The net force is 

Ay Az. Other forces in the x direction 

arise from the variation across the cube of 
the stresses Xy and X z , which are not shown. The 
net x component of the force on the cube is 



(dX, «, BX\ 

F -=Ur + i^ + ^-J A * A !' 


Ax 


The force equals the mass of the cube times 
the component of the acceleration in the x 
direction. The mass is p Ax Ay Az, and the 
acceleration is d 2 u/dt 2 . 






177 ;;: 






A 

— 

^ B 



' " 


UMh- 


Figure 19 If springs A and B are stretched equally, the block between them experiences no net 
force. This illustrates the fact that a uniform stress X, in a solid does not give a net force on a vol- 
ume element. If the spring at B is stretched more than the spring at A, the block between them 
will be accelerated by the force X x (B) — XjA). 


The corresponding equations of motion for d 2 v/dt 2 and d 2 w/dt 2 are found 
directly from (57a) by symmetry: 


dt 2 

d 2 w 
P 1? 




d 2 V _ ^ d 2 V , „ / d 2 V , d 2 V i , , n ,r-\ 
P TZ _ C 11 2 + l o + , o I + (C 12 + C 44 ) 


_ / . d tv 1 s, 

~ '-'11 - o ' 1 " °44 


dx 2 dz 2 

+ ) + (C 12 + C 44 ) 

dx dy 


/ du 

. d 2 w 

\dxdy 

dy dz 


d 2 v 


d 2 u 

dx dz dy dz 


(57b) 


(57c) 


We now look for simple special solutions of these equations. 


Waves in the [100] Direction 

One solution of (57a) is given by a longitudinal wave 


u =u 0 exp [i(Kr — <ut)] , (58) 

where u is the x component of the particle displacement. Both the wavevector 
and the particle motion are along the x cube edge. Here K = is the 


wavevector and w — 2 ttv is the angular frequency. If we substitute (58) into 
(57a) we find 

co 2 p = C U K? ■ ( 59 ) 

thus the velocity w/K of a longitudinal wave in the [100] direction is 

v s =v\= w/K = (C n /p) m . (60) 

Consider a transverse or shear wave with the wavevector along the x cube 
edge and with the particle displacement v in the y direction: 

v = u 0 exp [i(Kx — cat)] . (61) 

On substitution in (57b) this gives the dispersion relation 

w 2 p = ■ (62) 

thus the velocity w/K of a transverse wave in the [100] direction is 

t>. = (CWP) 1 * ■ (63) 

The identical velocity is obtained if the particle displacement is in the 2 direc- 
tion. Thus for K parallel to [100] the two independent shear waves have equal 
velocities. This is not true for K in a general direction in the crystal. 

Waves in the [110] Direction 

There is a special interest in waves that propagate in a face diagonal direc- 
tion of a cubic crystal, because the three elastic constants can be found simply 
from the three propagation velocities in this direction. 

Consider a shear wave that propagates in the xy plane with particle dis- 
placement w in the z direction 

w = w 0 exp [i(KjX + Kyy — wt)] , (64) 

whence (32c) gives 

< n 2 p=C 44 (^ + ^) = C 44 K 2 , (65) 

independent of propagation direction in the plane. 

Consider other waves that propagate in the xy plane with particle motion 
in the xy plane: let 

u =u 0 exp [i(Kp: + Kyij — wt)] ; v = v 0 exp [i(KjX + K^y — wt)] . ( 66 ) 

From (57a) and (57b), 


w z pu = ( C n K 2 x + C 44 K z )u + (C 12 + CJKcKyV ; 
w 2 pv = (C u K 2 y +C u K*)v + (C 12 + CuKKyU . 


( 67 ) 
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This pair of equations has a particularly simple solution for a wave in the [110] 
direction, for which K t = K, y = K/V 2 . The condition for a solution is that the 
determinant of the coefficients of u and v in (67) should equal zero: 

— w 2 p + 1 (C u + C 44 )K 2 
|(C 12 + C 44 )K 2 

This equation has the roots 

6i 2 p = |(Ci 1 + C 1 2 + 2C 44 )K 2 ; (o z p = i(C n -C 12 )K 2 . (69) 

The first root describes a longitudinal wave; the second root describes a 
shear wave. How do we determine the direction of particle displacement? The 
first root when substituted into the upper equation of (67) gives 

§(C„ + C 12 + 2C 44 )K 2 u = !(C U + Cjiflu + | (C 12 + C 44 )1 C 2 u , (70) 

whence the displacement components satisfy u = v. Thus the particle dis- 
placement is along [110] and parallel to the K vector (Fig. 20). The second 
root of (44) when substituted into the upper equation of (67) gives 

l(Cn - C 12 )K 2 u = |(C U + CJK.% + | (Cu + CjK*v , (71) 

whence u = —v. The particle displacement is along [llO] and perpendicular to 
the K vector. 

Selected values of the adiabatic elastic stiffness constants of cubic crystals 
at low temperatures and at room temperature are given in Table 11. Notice the 
general tendency for the elastic constants to decrease as the temperature is in- 
creased. Further values at room temperature alone are given in Table 12. 




2(^12 T C ^ K " 
-a> 2 p+i(C u + C u )K 2 


= 0 


( 68 ) 



Wave in [ 100 ] direction 
L:C 11 
T : C44 



\k 


Wave in [ 110 ] direction 
L ■ |(^u + ^12 + 2C44) 
Ti :C« 

7V|(C u -C 12 ) 



T 


Wave in [ 111 ] direction 
E • + 2C 4 2 + 4C44) 

T '■ l(C]i - C 12 + C44) 


Figure 20 Effective elastic constants for the three modes of elastic waves in the principal propa- 
gation directions in cubic crystals. The two transverse modes are degenerate for propagation in 
the [100] and [111] directions. 
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Table 11 Adiabatic elastic stiffness constants of cubic crystals 
at low temperature and at room temperature 

The values given at 0 K were obtained by extrapolation of measurements carried out 
down to 4 K. The table was compiled with the assistance of Professor Charles S. Smith. 


Crystal 


Stiffness constants, in 10 12 dyne/cm 2 ( 10 n N/m 2 ) 

Density, g/cm 3 

Cn 

•7 12 

C 44 

Temperature, K 

W 

5.326 

2.049 

1.631 

0 

19.317 


5.233 

2.045 

1.607 

300 

— 

Ta 

2.663 

1.582 

0.874 

0 

16.696 


2.609 

1.574 

0.818 

300 

— 

Cu 

1.762 

1.249 

0.818 

0 

9.018 


1.684 

1.214 

0.754 

300 

— 

A g 

1.315 

0.973 

0.511 

0 

10.635 


1.240 

0.937 

0.461 

300 

— 

Au 

2.016 

1.697 

0.454 

0 

19.488 


1.923 

1.631 

0.420 

300 

— 

A1 

1.143 

0.619 

0.316 

0 

2.733 


1.068 

0.607 

0.282 

300 

— 

K 

0.0416 

0.0341 

0.0286 

4 



0.0370 

0.0314 

0.0188 

295 


Pb 

0.555 

0.454 

0.194 

0 

11.599 


0.495 

0.423 

0.149 

300 

— 

Ni 

2.612 

1.508 

1.317 

0 

8.968 


2.508 

1.500 

1.235 

300 

— 

Pd 

2.341 

1.761 

0.712 

0 

12.132 


2.271 

1.761 

0.717 

300 

— 


Table 12 Adiabatic elastic stiffness constants of several 
cubic crystals at room temperature or 300 K 



Stiffness constants, in 10 12 dyne/cm 2 or 10 11 N/m 2 

c„ 

C i2 

c« 

Diamond 

10.76 

1.25 

5.76 

Na 

0.073 

0.062 

0.042 

Li 

0.135 

0.114 

0.088 

Ge 

1.285 

0.483 

0.680 

Si 

1.66 

0.639 

0.796 

GaSb 

0.885 

0.404 

0.433 

InSb 

0.672 

0.367 

0.302 

MgO 

2.86 

0.87 

1.48 

NaCl 

0.487 

0.124 

0.126 
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There are three normal modes of wave motion in a crystal for a given 
magnitude and direction of the wavevector K. In general, the polarizations 
(directions of particle displacement) of these modes are not exactly parallel or 
perpendicular to K. In the special propagation directions [100], [111], and 
[110] of a cubic crystal two of the three modes for a given K are such that the 
particle motion is exactly transverse to K and in the third mode the motion 
is exactly longitudinal (parallel to K). The analysis is much simpler in these 
special directions than in general directions. 


SUMMARY 

• Crystals of inert gas atoms are bound by the van der Waals interaction (in- 
duced dipole-dipole interaction), and this varies with distance as 1/R 6 . 

• The repulsive interaction between atoms arises generally from the electro- 
static repulsion of overlapping charge distributions and the Pauli principle, 
which compels overlapping electrons of parallel spin to enter orbitals of 
higher energy. 

• Ionic crystals are bound by the electrostatic attraction of charged ions of 
opposite sign. The electrostatic energy of a structure of 2 N ions of charge 
±q is 


(CGS) U = -Najr = -N £ —r~ , 

it ' ij 

where a is the Madelung constant and R is the distance between nearest 
neighbors. 

• Metals are bound by the reduction in the kinetic energy of the valence elec- 
trons in the metal as compared with the free atom. 

• A covalent bond is characterized by the overlap of charge distributions of 
antiparallel electron spin. The Pauli contribution to the repulsion is reduced 
for antiparallel spins, and this makes possible a greater degree of overlap. 
The overlapping electrons bind their associated ion cores by electrostatic 
attraction. 


Problems 

1. Quantum solid. In a quantum solid the dominant repulsive energy is the zero- 
point energy of the atoms. Consider a crude one-dimensional model of crystalline 
He 4 with each He atom confined to a line segment of length L. In the ground state 
the wave function within each segment is taken as a half wavelength of a free parti- 
cle. Find the zero-point kinetic energy per particle. 


2 . Cohesive energy ofbcc and fee neon. Using the Lennard-Jones potential, cal- 
culate the ratio of the cohesive energies of neon in the bee and fee structures (Ans. 
0,958). The lattice sums for the bee structures are 

2' p*" 12 = 9.11418 ; 2' P*' 6 = 12.2533 . 

j i 

3. Solid molecular hydrogen. For H 2 one finds from measurements on the gas that 
the Lennard-Jones parameters are e = 50 X 10 -16 erg and it = 2.96 A. Find the 
cohesive energy in kj per mole of H 2 ; do the calculation for an fee structure. 
Treat each II 2 molecule as a sphere. The observed value of the cohesive energy is 
0.751 kj/mol, much less than we calculated; thus, quantum corrections must be 
very important. 

4 . Possibility of ionic crystals R + R~. Imagine a crystal that exploits for binding the 
coulomb attraction of the positive and negative ions of the same atom or molecule 
R. This is believed to occur with certain organic molecules, but it is not found 
when R is a single atom. Use the data in Tables 5 and 6 to evaluate the stability of 
such a form of Na in the NaCl structure relative to normal metallic sodium. Evalu- 
ate the energy at the observed interatomic distance in metallic sodium, and use 
0.78 eV as the electron affinity of Na. 

5. Linear ionic crystal. Consider a line of 2 N ions of alternating charge ±q with a 
repulsive potential energy A/R" between nearest neighbors, (a) Show that at the 
equilibrium separation 

2Afa 2 ln2/ 

(CCS) WW-— • 


(b) Let the crystal be compressed so that fi(,— >Ro(l ~ 5)- Show that the work done 
in compressing a unit length of the crystal has the leading term |CS 2 , where 


(CCS) 


(n — l)q 2 In 2 

*0 


To obtain the results in SI, replace if by </ 2 /4ire„. Note: We should not expect to ob- 
tain this result from the expression for U(R 0 ), but we must use the complete expres- 
sion for U(R). 

6. Cubic ZnS structure. Using A and p from Table 7 and the Madelung constants 
given in the text, calculate the cohesive energy of KCl in the cubic ZnS structure 
described in Chapter 1. Compare with the value calculated for KCl in the NaCl 
structure. 


7. Divalent ionic crystals. Barium oxide has the NaCl structure. Estimate the 
cohesive energies per molecule of the hypothetical crystals Ba + 0~ and Ba ++ 0 
referred to separated neutral atoms. The observed nearest-neighbor intemuclear 
distance is R,, = 2.76 A; the first and second ionization potentials of Ba are 5.19 
and 9.96 eV; and the electron affinities of the first and second electrons added 
to the neutral oxygen atom are 1.5 and —9.0 eV. The first electron affinity of the 
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Tension 



Figure 21 Youngs modulus is defined as stress/strain for a ten- 
sile stress acting in one direction, with the specimen sides left 
free. Poissons ratio is defined as (8w/w)/(8l/l) for this situation. 


y 


X 


Undeformed body 


7 


+ 


Figure 22 This deformation is compounded 
from the two shears e a = ~e yy . 


neutral oxygen atom is the energy released in the reaction O + e — * O - . The sec- 
ond electron affinity is the energy released in the reaction O - + e —* O . Which 
valence state do you predict will occur? Assume R 0 is the same for both forms, and 
neglect the repulsive energy. 

8. Young’s modulus and Poisson’s ratio. A cubic crystal is subject to tension in the 
[100] direction. Find expressions in terms of the elastic stiffnesses for Youngs 
modulus and Poisson's ratio as defined in Fig. 21. 

9. Longitudinal wave velocity. Show that the velocity of a longitudinal wave in the 
[111] direction of a cubic crystal is given by v x — [|(C U + 2 C 12 + 4 C 44 )/p] yi . Hint: 
For such a wave u = v = u>. Let « = u 0 e ,K<I+! ' +z,/v/ ^e _, “ , , and use Eq. (57a). 

10 . Transverse wave velocity. Show that the velocity of transverse waves in the [111] 
direction of a cubic crystal is given by v s = [|(C 11 - C 12 + C^/p] 1 ®. Hint: See 
Problem 9. 

11 . Effective shear constant. Show that the shear constant |(C 11 — C n ) in a cubic 

crystal is defined by setting e a = — = \e and all other strains equal to zero, as in 

Fig. 22. Hint: Consider the energy density (43); look for a C' such that U = §C'e 2 . 

12 . Determinantal approach. It is known that an R-dimensional square matrix with 
all elements equal to unity has roots R and 0, with the R occurring once and the 
zero occurring R — 1 times. If all elements have the value p, then the roots are 
Rp and 0. (a) Show that if the diagonal elements are q and all other elements are 
p, then there is one root equal to (R — l)p + q and R — 1 roots equal to q — p. 
(b) Show from the elastic equation (57) for a wave in the [111] direction of a cubic 
crystal that the determinantal equation which gives m 2 as a function of K is 


q — arp p 

p q — 0 ) 2 p 

V P 


P 

P 

q-w 2 p 


= 0 


where q = ^fC 2 (C u + 2C 44 ) and p = 5 K 2 (C 12 + C 44 ). This expresses the condition 
that three linear homogeneous algebraic equations for the three displacement 



components u , v, w have a solution. Use the result of part (a) to find the three 
roots of a> 2 ; check with the results given for Problems 9 and 10. 

13. General propagation direction, (a) By substitution in (57) find the determinan- 
tal equation which expresses the condition that the displacement 

R(r) = [«,)& + r 0 y + moz] exp [i(K • r - tef)] 

be a solution of the elastic wave equations in a cubic crystal, (b) The sum of the 
roots of a determinantal equation is equal to the sum of the diagonal elements a u . 
Show from part (a) that the sum of the squares of the three elastic wave velocities 
in any direction in a cubic crystal is equal to (Cji + 2C 4i )/p. Recall that vf = ojVK 2 . 

14. Stability criteria. The criterion that a cubic crystal with one atom in the primi- 
tive cell be stable against small homogeneous deformations is that the energy den- 
sity (43) be positive for all combinations of strain components. What restrictions 
are thereby imposed on the elastic stiffness constants? (In mathematical language 
the problem is to find the conditions that a real symmetric quadratic form should 
be positive definite. The solution is given in books on algebra; see also Korn and 
Korn, Mathematical Handbook, McGraw-Hill, 1961, Sec. 13.5-6.) Ans. C 44 > 0, 
C n > 0, C'l! - Cf 2 > 0, and C u + 2C 12 > 0. For an example of the instability 
which results when Cn = C 12 , see L. R. Testardi et al., Phys. Rev. Letters 15, 
250 (1965). 
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Chapter 5 treats the thermal properties of phonons. 




Name 

Field 


Electron 

— 


Photon 

Electromagnetic wave 



Phonon 

Elastic wave 

Ill 

Plasmon 

Collective electron wave 



Magnon 

Magnetization wave 

- 

Polaron 

Electron + elastic deformation 

- 

Exeiton 

Polarization wave 


Figure 1 Important elementary excitations in solids. 


Figure 2 (Dashed lines) Planes of atoms 
when in equilibrium. (Solid lines) Planes 
of atoms when displaced as for a longitudi- 
nal wave. The coordinate u measures the 
displacement of the planes. 
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Figure 3 Planes of atoms as displaced during 
passage of a transverse wave. 
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CHAPTER 4: PHONONS I. CRYSTAL VIRRATIONS 


VIBRATIONS OF CRYSTALS WITH MONATOMIC BASIS 

Consider the elastic vibrations of a crystal with one atom in the primitive 
cell. We want to find the frequency of an elastic wave in terms of the wavevec- 
tor that describes the wave and in terms of the elastic constants. 

The mathematical solution is simplest in the [100], [110], and [111] propa- 
gation directions in cubic crystals. These are the directions of the cube edge, 
face diagonal, and body diagonal. When a wave propagates along one of these 
directions, entire planes of atoms move in phase with displacements either 
parallel or perpendicular to the direction of the wavevector. We can describe 
with a single coordinate u s the displacement of the plane s from its equilibrium 
position. The problem is now one dimensional. For each wavevector there are 
three modes as solutions for u s , one of longitudinal polarization (Fig. 2) and 
two of transverse polarization (Fig. 3). 

We assume that the elastic response of the crystal is a linear function of 
the forces. That is equivalent to the assumption that the elastic energy is a 
quadratic function of the relative displacement of any two points in the crystal. 
Terms in the energy that are linear in the displacements will vanish in 
equilibrium — see the minimum in Fig. 3.6. Cubic and higher-order terms may 
be neglected for sufficiently small elastic deformations. 

We assume that the force on the plane s caused by the displacement of the 
plane s + p is proportional to the difference u s+p — u s of their displacements. 
For brevity we consider only nearest-neighbor interactions, with p = ±1. The 
total force on s from planes s ± 1: 

F s = C(u s+ 1 - u s ) + C(n s _, - u s ) . (1) 

This expression is linear in the displacements and is of the form of Hooke’s law. 

The constant C is the force constant between nearest-neighbor planes 
and will differ for longitudinal and transverse waves. It is convenient hereafter 
to regard C as defined for one atom of the plane, so that F s is the force on one 
atom in the plane s. 

The equation of motion of an atom in the plane s is 

M = C(u s+1 + u s _! - 2 u s ) , (2) 

where M is the mass of an atom. We look for solutions with all displacements 
having the time dependence exp(— i<ot). Then cPuJdt 2 = — m 2 m s , and (2) becomes 

-Ma> 2 u s = C(u s+ i + u s _! - 2u s ) . (3) 
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This is a difference equation in the displacements u and has traveling 
wave solutions of the form: 

u s± i = u exp(isKa) exp{± iKa) , (4) 

where a is the spacing between planes and K is the wavevector. The value to 
use for a will depend on the direction of K. 

With (4), we have from (3): 

-o) 2 Mu exp (isKa) = Cu{exp[i(s + l)Ka] + exp[i(s — l)Ka] — 2 exp(isKa)) . (5) 

We cancel u exp (isKa) from both sides, to leave 

a ) 2 M = — Cfexp(iKh) + exp(— iKa) — 2] . (6) 

With the identity 2 cos Ka — exp (iKa) + exp (-iKa), we have the dispersion 
relation ct)(K). 

w 2 = (2C/M)(1 - cos Ka) . (7) 

The boundary of the first Brillouin zone lies at K = ±ir/a. We show from 
(7) that the slope of < o versus K is zero at the zone boundary: 

dxo 2 /dK = (2 Ca/M) sin Ka = 0 (8) 

at K = ±7 r/a, for here sin Ka = sin (±tt) = 0. The special significance of 
phonon wavevectors that lie on the zone boundary is developed in (12) below. 
By a trigonometric identity, (7) may be written as 

ft> 2 = (4 C/M) sin 2 I Ka ; ft) = (4C/M) 1 ' 2 |sin | Ka | . (9) 

A plot of ft> versus K is given in Fig. 4. 
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First Brillouin Zone 


What range of K is physically significant for elastic waves? Only those in 
the first Brillouin zone. From (4) the ratio of the displacements of two succes- 
sive planes is given by 


u s+l u exp[t(s + l)Ka] 


= exp (iKa) . 


( 10 ) 


u s u exp (isKa) 

The range — tt to +v for the phase Ka covers all independent values of the 
exponential. 

The range of independent values of K is specified by 


— 7 t < Ka< it , 


or 


< K < — 
a a 


This range is the first Brillouin zone of the linear lattice, as defined in 
Chapter 2. The extreme values are K max = ±7r/a. Values of K outside of the 
first Brillouin zone (Fig. 5) merely reproduce lattice motions described by 
values within the limits ±v/a. 

We may treat a value of K outside these limits by subtracting the integral 
multiple of 2v/a that will give a wavevector inside these limits. Suppose K lies out- 
side the first zone, but a related wavevector K' defined K' = K — Zim/a lies within 
the first zone, where n is an integer. Then the displacement ratio (10) becomes 


u s+1 /u s = exp(iKfl) = exp(i27m) exp[i(Ka - 27m)] = exp(iK'u) , (11) 

because exp(i27m) = 1. Thus the displacement can always be described by a 
wavevector within the first zone. We note that 27 m/a is a reciprocal lattice vec- 
tor because 2tt/o is a reciprocal lattice vector. Thus by subtraction of an appro- 
priate reciprocal lattice vector from K , we always obtain an equivalent 
wavevector in the first zone. 

At the boundaries K max = ±v/a of the Brillouin zone the solution u s = 
u exp (isKa) does not represent a traveling wave, but a standing wave. At the 
zone boundaries sK mxx a = ±«7r, whence 


u s = u exp(± isTr) = u (— l) s . 


( 12 ) 



Figure 5 The wave represented by the solid curve conveys no information not given by the 
dashed curve. Only wavelengths longer than 2a are needed to represent the motion. 



This is a standing wave: alternate atoms oscillate in opposite phases, because 
u s = ±1 according to whether s is an even or an odd integer. The wave moves 
neither to the right nor to the left. 

This situation is equivalent to Bragg reflection of x-rays: when the Bragg 
condition is satisfied a traveling wave cannot propagate in a lattice, but 
through successive reflections back and forth, a standing wave is set up. 

The critical value K max = ±irla found here satisfies the Bragg condition 
2d sin 9 = n\: we have 9 = |t t, d = a,K = 2tt/A, n = 1, so that A = 2a. With 
x-rays it is possible to have n equal to other integers besides unity because the 
amplitude of the electromagnetic wave has a meaning in the space between 
atoms, but the displacement amplitude of an elastic wave usually has a mean- 
ing only at the atoms themselves. 

Group Velocity 

The transmission velocity of a wave packet is the group velocity, given as 

v g - dcj/dK , 
or 

v g = grad K (K) , (13) 

the gradient of the frequency with respect to K. This is the velocity of energy 
propagation in the medium. 

With the particular dispersion relation (9), the group velocity (Fig. 6) is 

= (Ca 2 /M) m cos ^ Ka . (14) 

This is zero at the edge of the zone where K = 77/a. Here the wave is a standing 
wave, as in (12), and we expect zero net transmission velocity for a standing wave. 

Long Wavelength Limit 

When Ka 1 we expand cos Ka = 1 — |(Ka) 2 , so that the dispersion rela- 
tion (7) becomes 

M 2 = (C/M)K?a 2 . (15) 

The result that the frequency is directly proportional to the wavevector in the 
long wavelength limit is equivalent to the statement that the velocity of sound 
is independent of frequency in this limit. Thus v = (o/K, exactly as in the con- 
tinuum theory of elastic waves — in the continuum limit Ka < 1. 

Derivation of Force Constants from Experiment 

In metals the effective forces may be of quite long range and are carried 
from ion to ion through the conduction electron sea. Interactions have been 
found between planes of atoms separated by as many as 20 planes. We can make 
a statement about the range of the forces from the observed experimental 
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Figure 6 Group velocity versus K, for model of 
Fig. 4. At the zone boundary K = ir/a the group 
velocity is zero. 


dispersion relation for co. The generalization of the dispersion relation (7) top 
nearest planes is easily found to be 

co 2 = (2/M) 2 C_(l - cos pKa) . (16a) 

p>0 

We solve for the interplanar force constants C p by multiplying both sides 
by cos rKa , where r is an integer, and integrating over the range of indepen- 
dent values of K: 

r irla C iria 

M I dK ca| cos rKa = 2 2 C p I dK (1 — cos pKa) cos rKa 

J —it! a p> 0 J -irla 

= —2t TC r /a . 

The integral vanishes except for p — r. Thus 

C p = — f dK w| cos pKa 

ATT J —Tt/a 

gives the force constant at range pa, for a structure with a monatomic basis. 


(16b) 

(17) 


TWO ATOMS PER PRIMITIVE BASIS 

The phonon dispersion relation shows new features in crystals with two or 
more atoms per primitive basis. Consider, for example, the NaCl or diamond 
structures, with two atoms in the primitive cell. For each polarization mode in 
a given propagation direction the dispersion relation co versus K develops two 
branches, known as the acoustical and optical branches, as in Fig. 7. We have 
longitudinal LA and transverse acoustical TA modes, and longitudinal LO and 
transverse optical TO modes. 

If there are p atoms in the primitive cell, there are 3 p branches to the dis- 
persion relation: 3 acoustical branches and 3p - 3 optical branches. Thus ger- 
manium (Fig. 8a) and KBr (Fig. 8b), each with two atoms in a primitive cell, 
have six branches: one LA, one LO, two TA, and two TO. 
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Figure 7 Optical and acoustical branches of the dis- 
persion relation for a diatomic linear lattice, showing 
the limiting frequencies at K = 0 and K = = ir/a. 

The lattice constant is a. 





Figure 8a Phonon dispersion relations in the [111] 
direction in germanium at 80 K. The two TA phonon 
branches are horizontal at the zone boundary position, 
= (2ir/a)(Hf). The LO and TO branches coincide at 
K = 0; this also is a consequence of the crystal symmetry 
of Ge. The results were obtained with neutron inelastic 
scattering by G, Nilsson and G. Nelin, 


Figure 8b Dispersion curves in the [111] 
direction in KBr at 90 K, after A. D. B, 
Woods, B. N. Brockhouse, R. A. Cowley, 
and W. Cochran. The extrapolation to K = 0 
of the TO, LO branches are called <o T , to, . 


The numerology of the branches follows from the number of degrees of free- 
dom of the atoms. With p atoms in the primitive cell and N primitive cells, there 
are pN atoms. Each atom has three degrees of freedom, one for each of the x, y, z 
directions, making a total of 3 pN degrees of freedom for the crystal. The number 
of allowed K values in a single branch is just N for one Brillouin zone. 1 Thus the 


*We show in Chapter 5 by application of periodic boundary conditions to the modes of the 
crystal of volume V that there is one K value in the volume (2rr) 3 /V in Fourier space. The volume of a 
Brillouin zone is (2rr Vy. where V e is the volume of a crystal primitive oell. Thus the number of 
allowed K values in a Brillouin zone is V/V,,, which is just N, the number of primitive cells in the crystal. 
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M 2 

[■« a H 

Figure 9 A diatomic crystal structure with masses M h ,W 2 connected by force constant C be- 
tween adjacent planes. The displacements of atoms M| are denoted by u s _ 1 , u„ u, +1 , • • . , and of 
atoms M 2 by c s _,, v„ o, +1 . The repeat distance is a in the direction of the wavevector K. The atoms 
are shown in their undisplaced positions. 


LA and the two TA branches have a total of 31V modes, thereby accounting for 3 N 
of the total degrees of freedom. The remaining (3 p - 3 )N degrees of freedom are 
accommodated by the optical branches. 

We consider a cubic crystal where atoms of mass Mj lie on one set of planes 
and atoms of mass M 2 lie on planes interleaved between those of the first set 
(Fig. 9). It is not essential that the masses be different, but either the force con- 
stants or the masses will be different if the two atoms of the basis are in non- 
equivalent sites. Let a denote the repeat distance of the lattice in the direction 
normal to the lattice planes considered. We treat waves that propagate in a 
symmetry direction such that a single plane contains only a single type of ion; 
such directions are [111] in the NaCl structure and [100] in the CsCl structure. 

We write the equations of motion under the assumption that each plane 
interacts only with its nearest-neighbor planes and that the force constants are 
identical between all pairs of nearest-neighbor planes. We refer to Fig. 9 to 
obtain 


(Ptt s 

M 1 — = C(v i + v,- 1 -2u s ) ; 
M 2 -^ = C(u ,+ 1 + u s - 2vJ . 


(18) 


We look for a solution in the form of a traveling wave, now with different 
amplitudes u, v on alternate planes: 

u s = u exp(isKo) exp(— itot) ; v 2 = v exp(isKn) exp(— ^ lent) . (19) 

We define a in Fig. 9 as the distance between nearest identical planes, not 
nearest-neighbor planes. 

On substitution of (19) in (18) we have 

— o) 2 M \U = Ce[l + exp(— zKa)] — 2 Cu ; 

—<i) 2 M 2 v = Cu[exp(iKa) + 1] — 2Ct> . 


( 20 ) 
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The homogeneous linear equations have a solution only if the determinant of . 
the coefficients of the unknowns u, v vanishes: 


or 


1C - M ]( o 2 
— C[1 + exp(tKa)] 


—C[ 1 + exp(iKa)] 
1C - M 2 w 2 


= 0 , 


( 21 ) 


M X M 2 w 4 - 2C(Mj + M 2 )m 2 + 2C 2 (1 - cos Ka) = 0 . (22) 

We can solve this equation exactly for or, but it is simpler to examine the 
limiting cases Ka 1 and Ka = ±tt at the zone boundary. For small Ka we 
have cos Ka = 1 — | K 2 a 2 + . . . , and the two roots are 


(optical branch) ; (23) 

(acoustical branch) . (24) 

The extent of the first Brillouin zone is r/a < K < ir/a t where a is the repeat 
distance of the lattice. At K, ma = ±Tr/a the roots are 

<n 2 = 1C/M 1 ; a> 2 — 1C/M 2 . (25) 

The dependence of on K is shown in Fig. 7 for > M 2 . 

The particle displacements in the transverse acoustical (TA) and trans- 
verse optical (TO) branches are shown in Fig. 10. For the optical branch at 
K = 0 we find, on substitution of (23) in (20), 


^ 2C| i + v- 
M 2 


Mi 


<o 2 = 


ic 


Mi + M 2 


2„2 


■K 2 a 


u _ _ M2 

0 Mi 


(26) 


The atoms vibrate against each other, but their center of mass is fixed. If the 
two atoms carry opposite charges, as in Fig. 10, we may excite a motion of this 



Optical mode 


K 


Figure 10 Transverse optical and 
transverse acoustical waves in a di- 
atomic linear lattice, illustrated by the 
particle displacements for the two 
inodes at the same wavelength. 
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type with the electric field of a light wave, so that the branch is called the opti- 
cal branch. At a general K the ratio u/v will be complex, as follows from either 
of the equations (20). Another solution for the amplitude ratio at small K is 
u = v, obtained as the K = 0 limit of (24). The atoms (and their center of 
mass) move together, as in long wavelength acoustical vibrations, whence the 
term acoustical branch. 

Wavelike solutions do not exist for certain frequencies, here between 
(2C/M 1 ) m and (2C/M 2 ) L “. This is a characteristic feature of elastic waves in 
polyatomic lattices. There is a frequency gap at the boundary K mai = ±ir/a of 
the first Brillouin zone. 

QUANTIZATON OF ELASTIC WAVES 

The energy of a lattice vibration is quantized. The quantum of energy is 
called a phonon in analogy with the photon of the electromagnetic wave. The 
energy of an elastic mode of angular frequency o> is 

e = (n + | )hw (27) 

when the mode is excited to quantum number n; that is, when the mode is occu- 
pied by n phonons. The term § hco is the zero point energy of the mode. It occurs 
for both phonons and photons as a consequence of their equivalence to a quan- 
tum harmonic oscillator of frequency o>, for which the energy eigenvalues are 
also (n + |)fttu. The quantum theory of phonons is developed in Appendix C. 

We can quantize the mean square phonon amplitude. Consider the stand- 
ing wave mode of amplitude 

u = u 0 cos Kx cos tut . 

Here u is the displacement of a volume element from its equilibrium position 
at x in the crystal. The energy in the mode, as in any harmonic oscillator, is half 
kinetic energy and half potential energy, when averaged over time. The kinetic 
energy density is \ p(du/dt) 2 , where p is the mass density. In a crystal of volume 
V, the volume integral of the kinetic energy is \ pVco 2 u\ sin 2 tut. The time aver- 
age kinetic energy is 

§pVtu 2 u 2 = | (n + \)hco , (28) 

because <sin 2 cot > = The square of the amplitude of the mode is 

«5 = 4(n + 1 )h/pVco . (29) 

This relates the displacement in a given mode to the phonon occupancy n of 
the mode. 

What is the sign of tu? The equations of motion such as (2) are equations 
for tu 2 , and if this is positive then tu can have either sign, + or — . But the 


energy of a phonon must be positive, so it is conventional and suitable to view 
a> as positive. If the crystal structure is unstable, then a> 2 will be negative and a> 
will be imaginary. 


PHONON MOMENTUM 

A phonon of wavevector K will interact with particles such as photons, 
neutrons, and electrons as if it had a momentum hK. However, a phonon does 
not carry physical momentum. 

The reason that phonons on a lattice do not carry momentum is that a 
phonon coordinate (except for K = 0) involves relative coordinates of the 
atoms. Thus in an H 2 molecule the internuclear vibrational coordinate r x - r 2 
is a relative coordinate and does not carry linear momentum; the center of 
mass coordinate s(r 2 4- r 2 ) corresponds to the uniform mode K = 0 and can 
carry linear momentum. 

In crystals there exist wavevector selection rules for allowed transitions 
between quantum states. We saw in Chapter 2 that the elastic scattering of an 
x-ray photon by a crystal is governed by the wavevector selection rule 

k' = k + G , (30) 

where G is a vector in the reciprocal lattice, k is the wavevector of the incident 
photon, and k' is the wavevector of the scattered photon. In the reflection 
process the crystal as a whole will recoil with momentum — hG, but this uni- 
form mode momentum is rarely considered explicitly. 

Equation (30) is an example of the rule that the total wavevector of inter- 
acting waves is conserved in a periodic lattice, with the possible addition of a 
reciprocal lattice vector G. The true momentum of the whole system always is 
rigorously conserved. If the scattering of the photon is inelastic, with the 
creation of a phonon of wavevector K, then the wavevector selection rule 
becomes 


k' + K = k + G. (31) 

If a phonon K is absorbed in the process, we have instead the relation 

k' = k + K + G . (32) 

Relations (31) and (32) are the natural extensions of (30). 


INELASTIC SCATTERING BY PHONONS 

Phonon dispersion relations o»(K) are most often determined experimen- 
tally by the inelastic scattering of neutrons with the emission or absorption of a 
phonon. A neutron sees the crystal lattice chiefly by interaction with the nuclei 
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of the atoms. The kinematics of the scattering of a neutron beam by a crystal 
lattice are described by the general wavevector selection rule: 

k + G = k' ± K , (33) 

and by the requirement of conservation of energy. Here K is the wavevector of 
the phonon created ( + ) or absorbed ( — ) in the scattering process, and G is 
any reciprocal lattice vector. For a phonon we choose G such that K lies in the 
first Brillouin zone. 



Wavevector, in units 2 tr/a 

Figure 11 The dispersion curves of sodium for phonons propagating in the [001], [110], and 
[111] directions at 90 K, as determined by inelastic scattering of neutrons, by Woods, Brockhouse, 
March and Bowers. 



Figure 12 A triple axis neutron spectrometer at Brookhaven. (Courtesy of B. II. Grier.) 
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The kinetic energy of the incident neutron is p' 2 /2M„, where M„ is the mass 
of the neutron. The momentum p is given by ftk, where k is the wavevector of 
the neutron. Thus h 2 k 2 /2M„ is the kinetic energy of the incident neutron. If k' 
is the wavevector of the scattered neutron, the energy of the scattered neutron 
is h 2 k’ 2 /2M n . The statement of conservation of energy is 


h 2 k 2 
2 M n 


h 2 k ' 2 
2 ■ 


: hta 


(34) 


where ha> is the energy of the phonon created ( + ) or absorbed ( — ) in the 
process. 

To determine the dispersion relation using (33) and (34) it is necessary in 
the experiment to find the energy gain or loss of the scattered neutrons as a 
function of the scattering direction k - k'. Results for germanium and KBr are 
given in Fig. 8; results for sodium are given in Fig. II. A spectrometer used for 
phonon studies is shown in Fig. 12. 


SUMMARY 

• The quantum unit of a crystal vibration is a phonon. If the angular fre- 
quency is (o, the energy of the phonon is h<o. 

• When a phonon of wavevector K is created by the inelastic scattering of a 
photon or neutron from wavevector k to k', the wavevector selection rule that 
governs the process is 

k = k' + K + G , 

where G is a reciprocal lattice vector. 

• All elastic waves can be described by wavevectors that lie within the first 
Brillouin zone in reciprocal space. 

• If there are p atoms in the primitive cell, the phonon dispersion relation will 
have 3 acoustical phonon branches and 3p — 3 optical phonon branches. 

Problems 

1. Monatomic linear lattice. Consider a longitudinal wave 

u s — u eos{taf — sKa) 

which propagates in a monatomic linear lattice of atoms of mass M, spacing a, and 
nearest-neighbor interaction C. 

(a) Show that the total energy of the wave is 

E = \M 2 ( dujdt f + I C 2 (« s - « s +i) 2 • 


where s runs over all atoms. 
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(b) By substitution of u s in this expression, show that the time-average total energy 
per atom is 

| MA 2 + 1 C(1 - cos Ka)u 2 = f M«V , 
where in the last step we have used the dispersion relation (9) for this problem, 

2. Continuum wave equation. Show that for long wavelengths the equation of mo- 
tion (2) reduces to the continuum elastic wave equation 

d 2 u _ 2 d 2 U 
dt 2 V fix 2 ’ 

where v is the velocity of sound. 

3. Basis of two unlike atoms. For the problem treated by (18) to (26), find the am- 
plitude ratios u/v for the two branches at K lrl „ = ir/a. Show that at this value of K 
the two lattices act as if decoupled: one lattice remains at rest while the other lat- 
tice moves. 

4. Kohn anomaly. We suppose that the interplanar force constant C p between planes 
s and s + p is of the form 

sin pk 0 a 

C p =A pa ■ 

where A and k n are constants and p runs over all integers. Such a form is expected in 
metals. Use this and Eq. (16a) to find an expression for a> 2 and also for dw 2 /dK. Prove 
that do) 2 /dK is infinite when K = k„. Thus a plot of o> 2 versus K or of to versus K has a 
vertical tangent at k„: there is a kink at fc 0 in the phonon dispersion relation a>(K). 

5. Diatomic chain. Consider the normal modes of a linear chain in which the force 
constants between nearest-neighbor atoms are alternately C and IOC. Let the 
masses be equal, and let the nearest-neighbor separation be a/2. Find o>(K ) at 
K = 0 and K = n/a. Sketch in the dispersion relation by eye. This problem simu- 
lates a crystal of diatomic molecules such as H 2 . 

6. Atomic vibrations in a metal. Consider point ions of mass M and charge e im- 
mersed in a uniform sea of conduction electrons. The ions are imagined to be in 
stable equilibrium when at regular lattice points. If one ion is displaced a small dis- 
tance r from its equilibrium position, the restoring force is largely due to the elec- 
tric charge within the sphere of radius r centered at the equilibrium position. Take 
the number density of ions (or of conduction electrons) as 3/4 irR 3 , which defines R. 

(a) Show that the frequency of a single ion set into oscillation is w = (e 2 /MR 3 ) yi . 

(b) Estimate the value of this frequency for sodium, roughly, (c) From (a), (b), and 
some common sense, estimate the order of magnitude of the velocity of sound in 
the metal. 

*7. Soft phonon modes. Consider a fine of ions of equal mass but alternating in 
charge, with e p = e{ — l) p as the charge on the pth ion. The interatomic potential is 


’This problem is rather difficult. 


the sum of two contributions: (1) a short-range interaction of force constant ( ' 1K = y 
that acts between nearest neighbors only, and (2) a coulomb interaction between 
all ions, (a) Show that the contribution of the coulomb interaction to the atomic 
force constants is c P c — 2( — l) p e 2 /p 3 a 3 , where a is the equilibrium nearest-neigh- 
bor distance, (b) From (16a) show that the dispersion relation may be written as 

w 2 /ct*o — sin 2 ^Ka + <r2( — 1 ) p (1 - cos pKa)p~ 3 , 

p - 1 

where (ol 52 4y/M and <r = e 2 /ya 3 . (c) Show that at 2 is negative (unstable mode) at 
the zone boundary Ka = -tr if cr > 0.475 or 4/7£(3), where f is a Riemann zeta func- 
tion, Show further that the speed of sound at small Ka is imaginary if <r > (2 In 2) _I 
= 0,721. Thus <u 2 goes to zero and the lattice is unstable for some value of Ka in 
the interval (0, 7 r) if 0.475 < cr < 0.721. Notice that the phonon spectrum is not 
that of a diatomic lattice because the interaction of any ion with its neighbors is the 
same as that of any other ion. 



5 

Phonons II. Thermal Properties 


PHONON HEAT CAPACITY 107 

Planck distribution 107 

Norma! mode enumeration 108 

Density of states in one dimension 108 

Density of states in three dimensions 111 

Debye model for density of states 1 12 

Debye T 3 law 114 

Einstein model of the density of states 1 14 

General result for D(a>) 117 


ANHARMONIC CRYSTAL INTERACTIONS 
Thermal expansion 


119 

120 


THERMAL CONDUCTIVITY 

Thermal resistivity of phonon gas 
Umklapp processes 
Imperfections 


121 

123 

125 

126 


PROBLEMS 


128 


1. Singularity in density of states 128 

2. Rms thermal dilation of crystal cell 128 

3. Zero point lattice displacement and strain 128 

4. Heat capacity of layer lattice 128 

5. Griineisen constant 129 




CHAPTER 5: PHONONS II. THERMAL PROPERTIES 


We discuss the heat capacity of a phonon gas and then the effects of 
anharmonic lattice interactions on the phonons and on the crystal. 

PHONON HEAT CAPACITY 

By heat capacity we shall usually mean the heat capacity at constant vol- 
ume, which is more fundamental than the heat capacity at constant pressure, 
which is what the experiments determine. 1 The heat capacity at constant vol- 
ume is defined as C v = (dU/dT) v where U is the energy and T the temperature. 

The contribution of the phonons to the heat capacity of a crystal is called 
the lattice heat capacity and is denoted by Ci at . The total energy of the 
phonons at a temperature r(= k v T) in a crystal may be written as the sum of 
the energies over all phonon modes, here indexed by the wavevector K and 
polarization index p: 

tfiat = 22 ^ = 22 {n K )hM^ , (!) 

K p K p 

where (n Kp ) is the thermal equilibrium occupancy of phonons of wavevector 
K and polarization p. The form of (n^,) is given by the Planck distribution 
function: 


(n) = 


1 

exp (Am/t) — 1 


( 2 ) 


where the {• ■ ■) denotes the average in thermal equilibrium. A graph of (n) is 
given in Fig. 1. 

Planck Distribution 

Consider a set of identical harmonic oscillators in thermal equilibrium. 
The ratio of the number of oscillators in their (n + l)th quantum state of exci- 
tation to the number in the nth quantum state is 


N n+1 /N n = exp(-hw/T) , r=k B T , 


( 3 ) 


'A thermodynamic relation gives C p — C v = 9<r 2 BVT, where a is the temperature coefficient 
of linear expansion, V the volume, and B the bulk modulus. The fractional difference between C p 
and C v is visually small in solids and often may be neglected. As T— * 0 we see that C p —* C v , pro- 
vided a and B are constant. 
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by use of the Boltzmann factor. Thus the fraction of the total number of oscil- 
lators in the nth quantum state is 

N„ exp(— nftw/r) 

-= — = -= • (4) 

^jN s ^ exp (-shw/r) 

s = 0 5=0 

We see that the average excitation quantum number of an oscillator is 

2s exp (—shco/r) 

(n) = “b ; • (5) 

2j exp( —shcj/r) 

S 

The summations in (5) are 




, i-*' , d-r) ! ’ 

with x = exp(— hw/r). Thus we may rewrite (5) as the Planck distribution: 


1 — x exp (ftw/r) — 1 

Normal Mode Enumeration 

The energy of a collection of oscillators of frequencies o) Kp in thermal 
equilibrium is found from (1) and (2): 

= \ t ■ (8) 

k ,, exp {na> Kj /T) - 1 

It is usually convenient to replace the summation over K by an integral. Sup- 
pose that the crystal has D p (w)d(o modes of a given polarization p in the fre- 
quency range <u to a> + da>. Then the energy is 


. (9) 

The lattice heat capacity is found by differentiation with respect to tempera- 
ture. Let x = hto/r = tua/k B T: then dU/dT gives 

„ f x 2 exp x 

= k B 2 J do> D p (u)j-—?-^ . (10) 

The central problem is to find D(w), the number of modes per unit fre- 
quency range. This function is called the density of modes or, more often, den- 
sity of states. 

Density of States in One Dimension 

Consider the boundary value problem for vibrations of a one-dimensional 
line (Fig. 2) of length L carrying N + 1 particles at separation a. We suppose 


c to = j da)D p (a))- 
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s' Fixed 


s=0 1 2 


s = 10 


Figure 2 Elastic line of jV + 1 atoms, with N — 10, for boundary conditions that the end atoms 
s = 0 and s = 10 are fixed. The particle displacements in the normal modes for either longitudinal 
or transverse displacements are of the form u, sin sKa. This form is automatically zero at the 
atom at the end s = 0, and we choose K to make die displacement zero at the end s = 10 . 


-• a • • •- « * • ♦ — -- 

0 _7r_ 2tt IQrr 

J0a lOn "" 10a 

K - 

Figure 3 The boundary condition sin sKa — 0 for s — 10 can be satisfied by choosing K = 7r/l 0c, 
2rr/l 0a, . . ■ I .t/ 1 i)a . where 10a is the length L of the line. The present figure is in K space. The 

dots are not atoms but are the allowed values of K. Of the N + 1 particles on the line, only N — 1 
are allowed to move, and their most general motion can be expressed in terms of the N — 1 al- 
lowed values of K. This quantization of K has nothing to do with quantum mechanics but follows 
classically from the boundary conditions that the end atoms be fixed. 


that the particles s = 0 and s = N at the ends of the line are held fixed. Each 
normal vibrational mode of polarization p has the form of a standing wave, 
where u s is the displacement of the particle s: 

u s — m( 0) exp{ — i(o Kp t ) sin sKii , (11) 

where o) K]> is related to K by the appropriate dispersion relation. 

As in Fig. 3, the waveveetor K is restricted by the fixed-end boundary con- 
ditions to the values 


k= tt 2i t 3t t (N ~ 1)^ 

K L ’ L ’ L L 

The solution for K = tt/L has 

u s sin ( sira/L ) 


( 12 ) 


(13) 


and vanishes for s = 0 and s — N as required. 

The solution for K = Ntt/L — ir/a = K,,,,,, has u, cjr - sin stt-, this permits no 
motion of any atom, because sin sir vanishes at each atom. Thus there are 
N — 1 allowed independent values of K in (12). This number is equal to the 
number of particles allowed to move. Each allowed value of K is associated 
with a standing wave. For the one-dimensional line there is one mode for each 
interval A K — tt/L, so that the number of modes per unit range of K is L/tt for 
K ^ -rr/a, and 0 for K > Trla. 

There are three polarizations p for each value of K: in one dimension two 
of these are transverse and one longitudinal. In three dimensions the polariza- 
tions are this simple only for wavevectors in certain special crystal directions. 

Another device for enumerating modes is equally valid. We consider the 
medium as unbounded, but require that the solutions be periodic over a large 
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Figure 4 Consider N particles constrained to slide on 
a circular ring. The particles can oscillate if connected 
by elastic springs. In a normal mode the displacement 
u s of atom s will be of the form sin sKa or cos sKa. 
these are independent modes. By the geometrical pe- 
riodicity of the ring the boundary condition is that 
u v t , = u s for all s, so that NKa must be an integral 
multiple of 2 77 For N = 8 the allowed independent 
values of K are 0, 2ir/8a, 4ir/Ha, 6-jr/Sa, and 8w/8 a. The 
value K = 0 is meaningless for the sine form, because 
sin sOa = 0. The value 8ir/8a has a meaning only for 
the cosine form, because sin (s8ira/8a) = sins-rr = 0. 
The three other values of K are allowed for both the 
sine and cosine modes, giving a total of eight allowed 
modes for the eight particles. Thus the periodic 
boundary condition leads to one allowed mode per 
particle, exactly as for the fixed-end boundary condi- 
tion of Fig. 3. If we had taken the modes in the com- 
plex form exp(isKa), the periodic boundary condition 
would lead to the eight modes with K = 0, ±2ir/Na, 
±4ir/Na , ±6ir/Ma, and SroWa, as in Eq. (14). 



_Njr _6rr_4ir_2ir 0 2tt 4 tt 6tt Ntt 

L L L L L L L L 

K *- 

Figure 5 Allowed values of wavevector K for periodic boundary conditions applied to a linear lat- 
tice of periodicity N = 8 atoms on a line of length L. The K = 0 solution is the uniform mode. The 
special points ±Ntt/L represent only a single solution because exp(irr.v) is identical to exp(-iTrs); 
thus there are eight allowed modes, with displacements of the .sth atom proportional to 1, 
exp(±i-irs/4), exp(±irrs/2), exp( ±s3irs/4), exp(iirs). 


distance L, so that u(sa) — u(sa + L). The method of periodic boundary 
conditions (Figs. 4 and 5) does not change the physics of the problem in any 
essential respect for a large system. In the running wave solution 
u s = u(0) exp[/(sKfl — tu K f)] the allowed values of K are 


K= 0 , ± 


2 TT 

L 


4-77 

L 


677 

L 


Ntt 

L 


( 14 ) 


This method of enumeration gives the same number of modes (one per 
mobile atom) as given by (12), but we have now both plus and minus values of 
K, with the interval A K = 2ir/L between successive values of K. For periodic 
boundary conditions, the number of modes per unit range of K is L/2tt for 
— 77/0 ^ K < 77/u, and 0 otherwise. The situation in a two-dimensional lattice is 
portrayed in Fig. 6. 

We need to know D((o), the number of modes per unit frequency range for 
a given polarization. The number of modes D(a > ) du> in da> at a> is given in one 
dimension by 




( 15 ) 
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Figure 6 Allowed values in Fourier space of the phonon wavevector K for a square lattice of lat- 
tice constant a, with periodic boundary conditions applied over a square of side L = 10a. The uni- 
form mode is marked with a cross. There is one allowed value of K per area (2rr/10 a) 2 = (2ir/L) 2 , so 
that within the circle of area nK 2 the smoothed number of allowed points is ttK 2 (L/ 2v) 2 . 


We can obtain the group velocity da>/dK from the dispersion relation co versus 
K. There is a singularity in D/ojjwhenever the dispersion relation w(K) is hori- 
zontal; that is, whenever the group velocity is zero. 

Density of States in Three Dimensions 

We apply periodic boundary conditions over N 3 primitive cells within a 
cube of side L, so that K is determined by the condition 

expUCK,* + K# + K,z )] = exp{i[fC r (x + L) + Kjy + L) + iC(z + L)]} , (16) 

whence 


K x ,K y ,K z = 0 ; 


+ 2tt 4tt 
~ L ’ " L ’ 


Ntt 

L 


(17) 


Therefore, there is one allowed value of K per volume (2n/L) 3 in K space, or 



(18) 


allowed values of K per unit volume of K space, for each polarization and for 
each branch. The volume of the specimen is V = L 3 . 

The total number of modes with wavevector less than K is found from (18) 
to be ( L/2-77 ) 3 times the volume of a sphere of radius K. Thus 


N = (L/2tt) 3 (4itK73) 


(19) 
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for each polarization type. The density of states for each polarization is 

D(cu) = dN/dw = (V&/2n i )(dK/da>) . (20) 


Debye Model for Density of State s 

In the Debye approximation the velocity of sound is taken as constant for 
each polarization type, as it would be for a classical elastic continuum. The dis- 
persion relation is written as 

(o =vK , (21) 

with v the constant velocity of sound. 

The density of states (20) becomes 

DM = V«72ttV . (22) 

If there are N primitive cells in the specimen, the total number of acoustic 
phonon modes is N. A cutoff frequency <o D is determined by (19) as 

a>l = 6tt VN/V . (23) 

To this frequency there corresponds a cutoff waveveetor in K space: 

K d = co D /v = (6 ttN/V) V3 . (24) 

On the Debye model we do not allow modes of waveveetor larger than K u . The 
number of modes with K^K D exhausts the number of degrees of freedom of a 
monatomic lattice. 

The thermal energy (9) is given by 

U = Jdw D(<o)(n(w))ha> = ^ " doj J - < 2 5 ) 


for each polarization type. For brevity we assume that the phonon velocity is 
independent of the polarization, so that we multiply by the factor 3 to obtain 


U = 


3VA 

2ir z v 3 Jo 


do) 


0) 


hailT 


- 1 


3Vk 4 B P f* x 3 

2^v 3 h 3 h <?- 1 1 


(26) 


where x — h(o/r 35 ha>/k B T and 

x D = h(o D /k B T = d/T . 


(27) 


This defines the Debye temperature 6 in terms of <w D defined by (23). 
We may express 0 as 


_ fw ( eMy Y ' 3 
k B '\ V ) 


(28) 


m 
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Figure 7 Heat capacity C v of a solid, according to 
the Debye approximation. The vertical scale is in J 
mol -1 K _l . The horizontal scale is the temperature 
normalized to the Debye temperature 6. The re- 
gion of the T 3 law is below 0.10, The asymptotic 
value at high values of T/0 is 24.943 J mol -1 deg -1 . 



Figure 8 Heat capacity of silicon and germa- 
nium. Note the decrease at low temperatures. 
To convert a value in cal/mol-K to J/mol-K, 
multiply by 4.186. 


so that the total phonon energy is 


u - 9Nt 4«JC‘ lx A • m 

where N is the number of atoms in the specimen and x D = 6/T . 

The heat capacity is found most easily by differentiating the middle ex- 
pression of (26) with respect to temperature. Then 


r 3Vh 2 

v 2rrVfc B T 2 



to 4 e* mh 


{e hwh - l) 2 



x 4 e* 

(e* - l) 2 ' 


(30) 


The Debye heat capacity is plotted in Fig. 7. At T > 0 the heat capacity ap- 
proaches the classical value of 3 Nk B . Measured values for silicon and germa- 
nium are plotted in Fig. 8. 




Debye l :i Law 

At very low temperatures we may approximate (29) by letting the upper 
limit go to infinity. We have 

I dx - _ = 1 dxx 2, 2 exp(-sx) = 6^4 = tp , (31) 

^0 * 1 -^0 s=l 1 S iO 

where the sum overs -4 is found in standard tables. Thus U = 37r 4 A 7 /c s 7'Vo0 3 for 
T<8, and 

C v = ~~ Nk B (^J = 234 Nk B (^J , (32) 

which is the Debye T 3 approximation. Experimental results for argon are plot- 
ted in Fig. 9. 

At sufficiently low temperature the T 3 approximation is quite good; that is, 
when only long wavelength acoustic modes are thermally excited. These are just 
the modes that may be treated as an elastic continuum with macroscopic elastic 
constants. The energy of the short wavelength modes (for which this approxima- 
tion fails) is too high for them to be populated significantly at low temperatures. 

We understand the T 3 result by a simple argument (Fig. 10). Only those 
lattice modes having fm < k B T will be excited to any appreciable extent at a 
low temperature T. The excitation of these modes will be approximately classi- 
cal, each with an energy close to k B T, according to Fig. 1. 

Of the allowed volume in K space, the fraction occupied by the excited 
modes is of the order of {<D 1 Ju> D ) i or (K T /K D ) 3 , where K T is a “thermal” wavevec- 
tor defined such that fivK r = k B T and K n is the Debye cutoff wavevector. Thus 
the fraction occupied is (T/0) 3 of the total volume in K space. There are of the 
order of 3Af(T/0) 3 excited modes, each having energy k B T. The energy is 
~ -3Nk B T(T/8 ) 3 , and the heat capacity is ~12 Nk B (T/8) 3 . 

For actual crystals the temperatures at which the T 3 approximation holds 
are quite low. It may be necessary to be below T = 0/50 to get reasonably pure 
T 3 behavior. 

Selected values of 8 are given in Table 1. Note, for example, in the alkali 
metals that the heavier atoms have the lowest 0’s, because the velocity of 
sound decreases as the density increases. 

Einstein Model of the Density of States 

Consider N oscillators of the same frequency <« 0 and in one dimension. 
The Einstein density of states is D(o)) = N8(io — w 0 ), where the delta function 
is centered at <u 0 . The thermal energy of the system is 

U = N(n)h „ = J^- > (33) 

with o) now written in place of <n 0 , for convenience. 
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Figure 9 Low temperature heat capacity of solid argon, plotted against T 3 . In this temperature 
region the experimental results are in excellent agreement with the Debye T 3 law with 8 = 92.0 K. 
(Courtesy of L. Finegold and N. E. Phillips.) 


K 


y 



Figure 10 To obtain a qualitative explanation of the Debye T 3 law, we suppose that all phonon 
modes of wavevector less than K r have the classical thermal energy k„T and that modes between 
K r and the Debye cutoff K D are not excited at all. Of the 3 N possible modes, the fraction excited is 
(K t /K d ) 3 = ( T/8 ) 3 , because this is the ratio of the volume of the inner sphere to the outer sphere. 
The energy is U = k B T - 3 N(T/8) 3 , and the heat capacity is C v = dU/ST = 12 Nk K (T/8) 3 . 
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Mg 

400 

1.56 


327 

1.16 


Hf 

252 

0.23 


Ta 

240 

0.58 


Re 

430 

0.48 


Au 

165 

3.17 


Hg 

71.9 


Pb 

105 

0.35 


Os 

500 

0.88 


Table 1 Debye temperature and thermal conductivity 


158 

1.41 


92 


645 

1.48 


230 


360. 

0.16 


420 

0.22 


380 

0.31 


445 

1.00 


450 

0.91 


343 

4.01 


320 

0.41 


374 

0.60 


282 

0.50 


90 

0.02 


209 

0.97 


108 

0.82 


200 

0.67 


211 

0.24 


153 

0.02 


La /? 


142 

0.14 


400 

1.74 


38 

0.36 


110 


78.5 

0.46 


Low temperature limit of 0,, in Kelvin 
Thermal conductivity at 300 K, in W cm~'K~' 


0.11 0.12 I 0.16 


0.11 0.11 0.11 0.16 0.14 0.17 0.35 1 0.16 


207 I I 
0.28 I 0.06 I 0.07 
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Figure 1 1 Comparison of experimental values of the heat capacity of diamond with values calcu- 
lated on the earliest quantum (Einstein) model, using the characteristic temperature 
0 E = fw>/k B — 1320 K. To convert to J/mol-deg, multiply by 4.186. 


The heat capacity of the oscillators is 


Cy = 



fcd V e WT 

T J ( g fe»/T _ Yf ’ 


(34) 


as plotted in Fig. 11. This expresses the Einstein (1907) result for the contribu- 
tion of N identical oscillators to the heat capacity of a solid. In three dimensions 
N is replaced by 3JV, there being three modes per oscillator. The high tempera- 
ture limit of C v becomes 3 Nk B , which is known as the Dulong and Petit value. 

At low temperatures (34) decreases as exp(— hco/r), whereas the experi- 
mental form of the phonon contribution is known to be T 3 as accounted for by 
the Debye model treated above. The Einstein model, however, is often used to 
approximate the optical phonon part of the phonon spectrum. 


General Result for D(a>) 

We want to find a general expression for D(a>), the number of states per unit 
frequency range, given the phonon dispersion relation «(K). The number of al- 
lowed values of K for which the phonon frequency is between u> and co + do> is 


D{w) do> = 



(35) 


where the integral is extended over the volume of the shell in K space bounded 
by the two surfaces on which the phonon frequency is constant, one surface on 
which the frequency is w and the other on which the frequency is <a + dw. 

The real problem is to evaluate the volume of this shell. We let dS (l , denote 
an element of area (Fig. 12) on the surface in K space of the selected constant 
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K- 


Figure 12 Element of area dS„ on a constant 
frequency surface in K space. The volume 
between two surfaces of constant frequency at 
&) and ft) + d(o is equal to f dS w dft)/|V K ft> |. 



frequency co. The element of volume between the constant frequency surfaces 
co and co + dco is a right cylinder of base dS w and altitude dK L , so that 

I d?K — I dSJK ± , (36) 

J shell J 


Here dK ± is the perpendicular distance (Fig. 13) between the surface co con- 
stant and the surface co + dco constant. The value of dK ± will vary from one 
point to another on the surface. 

The gradient of co, which is V K w, is also normal to the surface co constant, 
and the quantity 

|V K o>| dK L = dco , 


is the difference in frequency between the two surfaces connected by dK ± . 
Thus the element of the volume is 


where v g 
we have 


dS 01 dK x = dS 0) ^- = dS IJ1 ^- 


|V K o)| is the magnitude of the group velocity of a phonon. For (35) 


D(co) dco = 




We divide both sides by dco and write V = L 3 for the volume of the crystal: the 
result for the density of states is 
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Surface co + dio = constant 



Figure 14 Density of states as a function of frequency for (a) the Debye solid and (b) an actual 
crystal structure. The spectrum for the crystal starts as a) 2 for small but discontinuities develop 
at singular points. 


The integral is taken over the area of the surface oi constant, in K space. The 
result refers to a single branch of the dispersion relation. We can use this re- 
sult also in electron band theory. 

There is a special interest in the contribution to D(a>) from points at which 
the group velocity is zero. Such critical points produce singularities (known as 
Van Hove singularities) in the distribution function (Fig. 14). 


ANHARMONIC CRYSTAL INTERACTIONS 

The theory of lattice vibrations discussed thus far has been limited in the 
potential energy to terms quadratic in the interatomic displacements. This is 
the harmonic theory; among its consequences are: 

• Two lattice waves do not interact; a single wave does not decay or change 
form with time. 

• There is no thermal expansion. 

• Adiabatic and isothermal elastic constants are equal. 

• The elastic constants are independent of pressure and temperature. 

• The heat capacity becomes constant at high temperatures T> 6. 
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In real crystals none of these consequences is satisfied accurately. The devia- 
tions may be attributed to the neglect of anharmonic (higher than quadratic) 
terms in the interatomic displacements. We discuss some of the simpler as- 
pects of anharmonic effects. 

Beautiful demonstrations of anharmonic effects are the experiments on 
the interaction of two phonons to produce a third phonon at a frequency 
<n 3 = nq + ai 2 . Three-phonon processes are caused by third-order terms in the 
lattice potential energy. The physics of the phonon interaction can be stated 
simply: the presence of one phonon causes a periodic elastic strain which 
(through the anharmonic interaction) modulates in space and time the elastic 
constant of the crystal. A second phonon perceives the modulation of the elas- 
tic constant and thereupon is scattered to produce a third phonon, just as from 
a moving three-dimensional grating. 

Thermal Expansion 

We may understand thermal expansion by considering for a classical oscil- 
lator the effect of anharmonic terms in the potential energy on the mean sepa- 
ration of a pair of atoms at a temperature T. We take the potential energy of the 
atoms at a displacement x from their equilibrium separation at absolute zero as 

U(x) — cx 2 — gr 3 — fx 4 , (38) 

with c, g, and/ all positive. The term in x 3 represents the asymmetry of the 
mutual repulsion of the atoms and the term in .r 4 represents the softening of the 
vibration at large amplitudes. The minimum at x = 0 is not an absolute mini- 
mum, but for small oscillations the form is an adequate representation of an in- 
teratomic potential. 

We calculate the average displacement by using the Boltzmann distribu- 
tion function, which weights the possible values of x according to their 
thermodynamic probability: 

I dxx exp[-/3I/(x)] 

<*> = ~ , 

I dx exp[-j3I/(x)] 

J — 30 

with j3 — l/k B T. For displacements such that the anharmonic terms in the 
energy are small in comparison with k B T, we may expand the integrands as 

/ dx x exp(-/3I7) = f dx [exp(-/3cx 2 )](x + /3gx 4 + pfx 5 ) = (37r 1/2 /4)(g/c 5/2 )j3 _3/2 ; 

/ dx exp(— /3t7) = J dx exp (— /3cr 2 ) = (tt y/c) 172 , (39) 

whence the thermal expansion is 

3g, 


(40) 
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Temperature, in K 


Figure 15 Lattice constant of solid argon as a 
function of temperature. 


in the classical region. Note that in (39) we have left cx 2 in the exponential, but 

we have expanded exp(/3gx 3 + /3 fx 4 ) = 1 + jSgx 3 + fifx 4 -t 

Measurements of the lattice constant of solid argon are shown in Fig. 15. 
The slope of the curve is proportional to the thermal expansion coefficient. 
The expansion coefficient vanishes as T — * 0, as we expect from Problem 5. In 
lowest order the thermal expansion does not involve the symmetric term fx 4 in 
U(x), but only the antisymmetric term gx : \ 


THERMAL CONDUCTIVITY 

The thermal conductivity coefficient K of a solid is defined with respect to 
the steady-state flow of heat down a long rod with a temperature gradient 
dT/dx: 

(41) 

where j v is the flux of thermal energy, or the energy transmitted across unit 
area per unit time. 

This form implies that the process of thermal energy transfer is a random 
process. The energy does not simply enter one end of the specimen and pro- 
ceed directly (ballistically) in a straight path to the other end, but diffuses 
through the specimen, suffering frequent collisions. If the energy were propa- 
gated directly through the specimen without deflection, then the expression 
for the thermal flux would not depend on the temperature gradient, but only 
on the difference in temperature A T between the ends of the specimen, re- 
gardless of the length of the specimen. The random nature of the conductivity 
process brings the temperature gradient and, as we shall see, a mean free path 
into the expression for the thermal flux. 
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Table 2 Phonon mean free paths 

[Calculated from (44), taking v = 5 X 10 6 cm/sec as a representative sound velocity. 
The €'s obtained in this way refer to umklapp processes.] 


Crystal 

T, °C 

C, in J ciii - ' 1 K -1 

K, in W cm -1 r' 

f, in A 

Quart/.* 

0 

2.00 

0.13 

40 


-190 

0.55 

0.50 

540 

NaCl 

0 

1.88 

0.07 

23 


-190 

1.00 

0.27 

100 


a Parallel to optic axis. 


From the kinetic theory of gases we find below the following expression 
for the thermal conductivity: 

K = fCcf , (42) 

where C is the heat capacity per unit volume, t> is tire average particle velocity, 
and i is the mean free path of a particle between collisions. This result was ap- 
plied first by Debye to describe thermal conductivity in dielectric solids, with C 
as the heat capacity of the phonons, v the phonon velocity, and (■ the phonon 
mean free path. Several representative values of the mean free path are given 
in Table 2. 

We give the elementary kinetic theory which leads to (42). The flux of par- 
ticles in the x direction is fn{|tv|), where n is the concentration of molecules; 
in equilibrium there is a flux of equal magnitude in the opposite direction. The 
{■ ■ ■) denote average value. 

If c is the heat capacity of a particle, then in moving from a region at local 
temperature T + AT to a region at local temperature T a particle will give up 
energy c AT. Now AT between the ends of a free path of the particle is given by 


A T = 


d,T „ _ dr 

dx x dx 


0 t T 


where r is the average time between collisions. 

The net flux of energy (from both senses of the particle flux) is therefore 

j v = -n(v 2 )cT^= -~n(v 2 )cT^ . (43) 

If, as for phonons, v is constant, we may write (43) as 

<«> 

with € = vt and C = nr. Thus K — iCvt. 
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Thermal Resistivity of Phonon Gas 

The phonon mean free path € is determined principally by two processes, 
geometrical scattering and scattering by other phonons. If the forces between 
atoms were purely harmonic, there would be no mechanism for collisions be- 
tween different phonons, and the mean free path would be limited solely by 
collisions of a phonon with the crystal boundary, and by lattice imperfections. 
There are situations where these effects are dominant. 

With anharmonic lattice interactions, there is a coupling between differ- 
ent phonons which limits the value of the mean free path. The exact states of 
the anharmonic system are no longer like pure phonons. 

The theory of the effect of anharmonic coupling on thermal resistivity' pre- 
dicts that € is proportional to I IT at high temperatures, in agreement with 
many experiments. We can understand this dependence in terms of the num- 
ber of phonons with which a given phonon can interact: at high temperature 
the total number of excited phonons is proportional to T. The collision fre- 
quency of a given phonon should be proportional to the number of phonons 
with which it can collide, whence (. 1/T. 

To define a thermal conductivity there must exist mechanisms in the crys- 
tal whereby the distribution of phonons may be brought locally into thermal 
equilibrium. Without such mechanisms we may not speak of the phonons at 
one end of the crystal as being in thermal equilibrium at a temperature T 2 and 
those at the other end in equilibrium at L\. 

It is not sufficient to have only a way of limiting the mean free path, but 
there must also be a way of establishing a local thermal equilibrium distribu- 
tion of phonons. Phonon collisions with a static imperfection or a crystal 
boundary will not by themselves establish thermal equilibrium, because such 
collisions do not change the energy of individual phonons: the frequency co 2 of 
the scattered phonon is equal to the frequency n>, of the incident phonon. 

It is rather remarkable also that a three-phonon collision process 

K x + K 2 = K 3 (45) 

will not establish equilibrium, but for a subtle reason: the total momentum of 
the phonon gas is not changed by such a collision. An equilibrium distribution 
of phonons at a temperature T can move down the crystal with a drift velocity 
which is not disturbed by three-phonon collisions of the form (45). For such 
collisions the phonon momentum 

J = £n K ftK (46) 

K 

is conserved, because on collision the change in J is K 3 — K 2 — K x = 0. Here n K 
is the number of phonons having wavevector K. 

For a distribution with J A 0, collisions such as (45) are incapable of es- 
tablishing complete thermal equilibrium because they leave J unchanged. If 


Figure 16a Flow of gas molecules in a state of drifting equilibrium down a long open tube with 
frictionless walls. Elastic collision processes among the gas molecules do not change the momen- 
tum or energy flux of the gas because in each collision the velocity of the center of mass of the col- 
liding particles and their energy remain unchanged. Thus energy is transported from left to right 
without being driven by a temperature gradient. Therefore the thermal resistivity is zero and the 
thermal conductivity is infinite. 



Figure 16b The usual definition of thermal conductivity in a gas refers to a situation where no 
mass flow is permitted. Here the tube is closed at both ends, preventing the escape or entrance of 
molecules. With a temperature gradient the colliding pairs with above-average center of mass ve- 
locities will tend to be directed to the right, those with below-average velocities will tend to be di- 
rected to the left. A slight concentration gradient, high on the right, will be set up to enable the 
net mass transport to be zero while allowing a net energy transport from the hot to the cold end. 



Figure 16c In a crystal we may arrange to create phonons chiefly at one end, as by illuminating 
the left end with a lamp. From that end there will be a net flux of phonons toward the right end of 
the crystal. If only N processes (Kj + K 2 = K 3 ) occur, the phonon flux is unchanged in momentum 
on collision and some phonon flux will persist down the length of the crystal. On arrival of 
phonons at the right end we can arrange in principle to convert most of their energy to radiation, 
thereby creating a sink for the phonons. Just as in (a) the thermal resistivity is zero. 


we start a distribution of hot phonons down a rod with J ¥= 0, the distribution 
will propagate down the rod with J unchanged. Therefore there is no thermal 
resistance. The problem as illustrated in Fig. 16 is like that of the collisions be- 
tween molecules of a gas in a straight tube with frictionless walls. 
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Figure 16d In U processes there is a large net change in phonon momentum in each collision 
event. An initial net phonon flux will rapidly decay as we move to the right. The ends may act as 
sources and sinks. Net energy transport under a temperature gradient occurs as in (b). 



(a) 



(b) 


Figure 17 (a) Normal Kj + K 2 = K 3 and (b) umklapp K x + K 2 = K 3 + G phonon collision 

processes in a two-dimensional square lattice. The square in each figure represents the first 
Brillouin zone in the phonon K space; this zone contains all the possible independent values of the 
phonon wavevector. Vectors K with arrowheads at the center of the zone represent phonons 
absorbed in the collision process; those with arrowheads away from the center of the zone repre- 
sent phonons emitted in the collision. We see in (b) that in the umklapp process the direction of 
the r-component of the phonon flux has been reversed. The reciprocal lattice vector G as shown is 
of length 2 ir/a, where a is the lattice constant of the crystal lattice, and is parallel to the K s axis. 
For all processes, N or U, energy must be conserved, so that a> 1 + o> 2 = o> 3 . 


Umklapp Processes 

The important three-phonon processes that cause thermal resistivity are 
not of the form K x + K 2 = K 3 in which K is conserved, but are of the form 

K, + K 2 = K 3 + G , (47) 

where G is a reciprocal lattice vector (Fig. 17). These processes, discovered by 
Peierls, are called umklapp processes. We recall that G may occur in all mo- 
mentum conservation laws in crystals. In all allowed processes of the form of 
(46) and (47), energy is conserved. 


We have seen examples of wave interaction processes in crystals for which 
the total wavevector change need not be zero, but may be a reciprocal lattice 
vector. Such processes are always possible in periodic lattices. The argument is 
particularly strong for phonons: the only meaningful phonon Ks lie in the first 
Brillouin zone, so that any longer K produced in a collision must be brought 
back into the first zone by addition of a G. A collision of two phonons both 
with a negative value of K x can by an umklapp process ( G + 0}, create a phonon 
with positive K,. Umklapp processes are also called U processes. 

Collisions in which G = 0 are called normal processes or N processes. At 
high temperatures T> 8 all phonon modes are excited because k B T > 

A substantial proportion of all phonon collisions will then be U processes, with 
the attendant high momentum change in the collision. In this regime we can 
estimate the thermal resistivity without particular distinction between N and U 
processes; by the earlier argument about nonlinear effects we expect to find a 
lattice thermal resistivity x T at high temperatures. 

The energy of phonons K 1: K 2 suitable for umklapp to occur is of the order 
of ^k n 0, because each of the phonons 1 and 2 must have wavevectors of the 
order of |G in order for the collision {47) to be possible. If both phonons have 
low K, and therefore low energy, there is no way to get from their collision a 
phonon of wavevector outside the first zone. The umklapp process must con- 
serve energy, just as for the normal process. At low temperatures the number 
of suitable phonons of the high energy § k B 0 required may be expected to vary 
roughly as exp (—0/27’), according to the Boltzmann factor. The exponential 
form is in good agreement with experiment. In summary, the phonon mean 
free path which enters (42) is the mean free path for umklapp collisions be- 
tween phonons and not for all collisions between phonons. 


Imperfections 

Geometrical effects may also be important in limiting the mean free path. 
We must consider scattering by crystal boundaries, the distribution of isotopic 
masses in natural chemical elements, chemical impurities, lattice imperfec- 
tions, and amorphous structures. 

When at low temperatures the mean free path 1 becomes comparable with 
the width of the test specimen, the value of € is limited by the width, and the 
thermal conductivity becomes a function of the dimensions of the specimen. This 
effect was discovered by de Haas and Biermasz. The abrupt decrease in thermal 
conductivity of pure crystals at low' temperatures is caused by the size effect. 

At low temperatures the umklapp process becomes ineffective in limiting 
the thermal conductivity, and the size effect becomes dominant, as shown in 
Fig. 18. One would expect then that the phonon mean free path would be con- 
stant and of the order of the diameter D of the specimen, so that 


K = CvD . 


(48) 
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Figure 18 Thermal conductivity of a 
highly purified crystal of sodium fluo- 
ride, after II. E. Jackson, C. T. Walker, 
and T. F. McNelly. 



Temperature, in K 


Figure 19 Isotope effect on thermal 
conduction in germanium, amounting to 
a factor of three at the conductivity 
maximum. The enriched specimen is 96 
percent Ge 74 , natural germanium is 20 
percent Ge 70 , 27 percent Ge' 2 , 8 percent 
Ge 73 , 37 percent Ge 74 , and 8 percent 
Ge 76 . Below 5 K the enriched specimen 
has K = 0.06 T 3 , which agrees well with 
Casimir’s theory for thermal resistance 
caused by boundary scattering. (After 
T. H. Geballe and G W. Hull.) 


The only temperature-dependent term on the right is C, the heat capacity, 
which varies as T i at low temperatures. We expect the thermal conductivity to 
vary as T 3 at low temperatures. The size effect enters whenever the phonon 
mean free path becomes comparable with the diameter of the specimen. 

Dielectric crystals may have thermal conductivities as high as metals. Syn- 
thetic sapphire (AI 2 0 3 ) has one of the highest values of the conductivity: nearly 




200 W cm 1 K 1 at 30 K. The maximum of the thermal conductivity in sapphire 
is greater than the maximum of 100 W cm -1 KT 1 in copper. Metallic gallium, 
however, has a conductivity of 845 W cm -1 K _1 at 1.8 K. The electronic contri- 
bution to the thermal conductivity of metals is treated in Chapter 6. 

In an otherwise perfect crystal, the distribution of isotopes of the chemical 
elements often provides an important mechanism for phonon scattering. The 
random distribution of isotopic mass disturbs the periodicity of the density as 
seen by an elastic wave. In some substances scattering of phonons by isotopes 
is comparable in importance to scattering by other phonons. .Results for ger- 
manium are shown in Fig. 19. Enhanced thermal conductivity has been ob- 
served also in isotopically pure silicon and diamond; the latter has device 
importance as a heat sink for laser sources. 


Problems 

1. Singularity in density of states, (a) From the dispersion relation derived in Chap- 
ter 4 for a monatomic linear lattice of N atoms with nearest-neighbor interactions, 
show that the density of modes is 


D(o>) = 


2 N 

TT 


(«m 


1 


where ai m is the maximum frequency, (b) Suppose that an optical phonon branch 
has the form a>(K) = <o 0 — AK 2 , near K = 0 in three dimensions. Show that Dim) = 
(L/27r) 3 (2ir/A 3/2 )(a> 0 — (a) m for w < (o 0 and Dim) = 0 for to > w 0 . Here the density 
of modes is discontinuous. 

2. Rms thermal dilation of crystal cell, (a) Estimate for 300 K the root mean 
square thermal dilation AV7V for a primitive cell of sodium. Take the bulk modulus 
as 7 X 10 10 erg cm -3 . Note that the Debye temperature 158 K is less than 300 K, so 
that the thermal energy is of the order of h B T. (b) Use this result to estimate the root 
mean square thermal fluctuation A a/a of the lattice parameter. 

3. Zero point lattice displacement and strain, (a) In the Debye approximation, 

show that the mean square displacement of an atom at absolute zero is (R 2 ) — 
3ha)%/8v 2 pv s , where v is the velocity of sound. Start from the result (4.29) summed 
over the independent lattice modes: (R 2 ) - (fi/2pV)2ta -1 . We have included a factor 
of g to go from mean square amplitude to mean square displacement, (b) Show that 
SoT 1 and (R z ) diverge for a one-dimensional lattice, but that the mean square strain 
is finite. Consider ((dR/dx) 2 ) = as the mean square strain, and show that it is 

equal to htafL/AMlSv 3 ' for a line of N atoms each of mass M, counting longitudinal 
modes only. The divergence of R 2 is not significant for any physical measurement. 

4. Heat capacity of layer lattice, (a) Consider a dielectric crystal made up oflayers 
of atoms, with rigid coupling between layers so that the motion of the atoms 
is restricted to the plane of the layer. Show that the phonon heat capacity in 
the Debye approximation in the low temperature limit is proportional to T 2 . 
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(b) Suppose instead, as in many layer structures, that adjacent layers are very weakly 
bound to each other. What form would you expect the phonon heat capacity to ap- 
proach at extremely low temperatures? 

*5. Griineisen constant, (a) Show that the free energy' of a phonon mode of fre- 
quency to is k s T In [2 sinh (hw/2k H T)\. It is necessary to retain the zero-point energy 
jfiuj to obtain this result, (b) If A is the fractional volume change, then the free en- 
ergy of the crystal may be written as 

F( A, T) = |BA 2 + A- b t 2 In [2 sinh (hw K /2 k s T)\ , 

where B is the bulk modulus. Assume that the volume dependence of tu K is 
Sw/w = — yA, where y is known as the Criineisen constant. If y is taken as indepen- 
dent of the mode K, show that F is a minimum with respect to A when BA = yAlhw 
coth (hoi/2 k B T), and show that this may be written in terms of the thermal energy 
density as A = yU(T)/B. (c) Show that on the Debye model y = —d In 6/d In V. Note: 
Many approximations are involved in this theory: the result (a) is valid only if <n is in- 
dependent of temperature; y may be quite different for different modes. 


*This problem is somewhat difficult. 
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Figure 1 Schematic model of a crystal of sodium metal. The atomic cores are Na + ions: they are 
immersed in a sea of conduction electrons. The conduction electrons are derived from the 3s 
valence electrons of the free atoms. The atomic cores contain 10 electrons in the configuration 
ls 2 2s 2 2p 6 . In an alkali metal the atomic cores occupy a relatively small part ( — 15 percent) of the 
total volume of the crystal, but in a noble metal (Cu, Ag, Au) the atomic cores are relatively larger 
and may be in contact with each other. The common crystal structure at room temperature is 
bcc for the alkali metals and fee for the noble metals. 
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CHAPTER 6: FREE ELECTRON FERMI GAS 


In a theory which has given results like these, 
there must certainly be a great deal of truth. 

H. A. Lorentz 


We can understand many physical properties of metals, and not only of the 
simple metals, in terms of the free electron model. According to this model, tire 
valence electrons of the constituent atoms become conduction electrons and 
move about freely through the volume of the metal. Even in metals for which 
the free electron model works best, the charge distribution of the conduction 
electrons reflects the strong electrostatic potential of the ion cores. The utility 
of the free electron model is greatest for properties that depend essentially on 
the kinetic properties of the conduction electrons. The interaction of the 
conduction electrons with the ions of the lattice is treated in the next chapter. 

The simplest metals are the alkali metals — lithium, sodium, potassium, 
cesium, and rubidium. In a free atom of sodium the valence electron is in a 
3s state; in the metal this electron becomes a conduction electron in the 3.s 
conduction band. 

A monovalent crystal which contains N atoms will have N conduction 
electrons and N positive ion cores. The Na + ion core contains 10 electrons that 
occupy the Is, 2s, and 2p shells of the free ion, with a spatial distribution that 
is essentially the same when in the metal as in the free ion. The ion cores fill 
only about 15 percent of the volume of a sodium crystal, as in Fig. 1. The 
radius of the free Na + ion is 0.98 A, whereas one-half of the nearest-neighbor 
distance of the metal is 1.83 A. 

The interpretation of metallic properties in terms of the motion of free 
electrons was developed long before the invention of quantum mechanics. The 
classical theory had several conspicuous successes, notably the derivation of the 
form of Ohms law and the relation between the electrical and thermal conduc- 
tivity. The classical theory fails to explain the beat capacity and the magnetic 
susceptibility of the conduction electrons. (These are not failures of the free 
electron model, but failures of the classical Maxwell distribution function.) 

There is a further difficulty with the classical model. From many types of 
experiments it is clear that a conduction electron in a metal can move freely in 
a straight path over many atomic distances, undeflected by collisions with 
other conduction electrons or by collisions with the atom cores. In a very pure 
specimen at low temperatures, the mean free path may be as long as 10 8 inter- 
atomic spacings (more than 1 cm). 

Why is condensed matter so transparent to conduction electrons? The 
answer to the question contains two parts: (a) A conduction electron is not 
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deflected by ion cores arranged on a periodic lattice because matter waves can 
propagate freely in a periodic structure, as a consequence of the mathematics 
treated in the following chapter, (b) A conduction electron is scattered only in- 
frequently by other conduction electrons. This property is a consequence of 
the Pauli exclusion principle. By a free electron Fermi gas, we shall mean a 
gas of free electrons subject to the Pauli principle. 


ENERGY LEVELS IN ONE DIMENSION 


Consider a tree electron gas in one dimension, taking account of quantum 
theory and of the Pauli principle. An electron of mass m is confined to a length L 
by infinite barriers (Fig. 2). The wavefunction i/e M (x) of the electron is a solu- 
tion of the Schrodinger equation W.iji — ei <p; with the neglect of potential energy 
we have Di = p 2 /2m, where p is the momentum. In quantum theory p may be 
represented by the operator — ih d/dx, so that 


m. = - 


ft 2 d 2 \jf n 
2 m dx 2 


= 


(1) 


where e„ is the energy of the electron in the orbital. 

We use the term orbital to denote a solution of the wave equation for a 
system of only one electron. The term allows us to distinguish between an 
exact quantum state of the wave equation of a system of N interacting elec- 
trons and an approximate quantum state which we construct by assigning the 
N electrons to N different orbitals, where each orbital is a solution of a wave 
equation for one electron. The orbital model is exact only if there are no inter- 
actions between electrons. 

The boundary' conditions are i/q(0) — 0; >l> n (L) = 0, as imposed by the infi- 
nite potential energy barriers. They are satisfied if the wavefunction is sinelike 
with an integral number n of half-wavelengths between 0 and L: 


I . ■ r 277 

I K = A sin | J- X 


i nA„ = L 


( 2 ) 


where A is a constant. We see that (2) is a solution of (1), because 


= a IfLE ( cm (l 12L X 

dx A \l) COS \L X 

whence the energy' e n is given by 


d'&n A / hit \ . I MT 

sm (T x 


_ Ji?_ I nw 
2m \ L 


( 3 ) 


We want to accommodate N electrons on the line. According to the Pauli 
exclusion principle, no two electrons can have all their quantum numbers 
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Figure 2 First three energy levels and wave- 
functions of a free electron of mass m confined 
to a line of length L. The energy levels arc la- 
beled according to the quantum number n 
which gives the number of half -wavelengths in 
the wavefunction. The wavelengths are indi- 
cated on the wavefunctions. The energy f„ of 
the level of quantum number n is equal to 
(h 2 /2m)(n/2Lf. 


identical. That is, each orbital can be occupied by at most one electron. This 
applies to electrons in atoms, molecules, or solids. 

In a linear solid the quantum numbers of a conduction electron orbital are 
n and m s , where n is any positive integer and the magnetic quantum number 
m s = ±\, according to spin orientation. A pair of orbitals labeled by the quan- 
tum number n can accommodate two electrons, one with spin up and one with 
spin down. 

If there are six electrons, then in the ground state of the system the filled 
orbitals are those given in the table: 


n 

TO, 

Electron 

occupancy 

n 

TO, 

Electron 

occupancy 

i 

t 

i 

3 

t 

i 

i 

i 

i 

3 

4 

i 

2 

T 

i 

4 

T 

0 

2 

i 

i 

4 

4 

0 


More than one orbital may have the same energy. The number of orbitals with 
the same energy is called the degeneracy. 

Let n F denote the topmost filled energy level, where we start filling the 
levels from the bottom (n = 1) and continue filling higher levels with elec- 
trons until all N electrons are accommodated. It is convenient to suppose that 
N is an even number. The condition 2 n F = N determines n F , the value of n for 
the uppermost filled level. 

The Fermi energy e F is defined as the energy of the topmost filled level 
in the ground state of the N electron system. By (3) with n = n F we have in one 
dimension: 


_ ( nur Y _ ( AttA 

F 2 m \ L ) 2m \2L ) 


2 


( 4 ) 


EFFECT OF TEMPERATURE ON THE FERMI-DIRAC DISTRIBUTION 


The ground state is the state of the N electron system at absolute zero. 
What happens as the temperature is increased? This is a standard problem in 
elementary statistical mechanics, and the solution is given by the Fermi-Dirac 
distribution function (Appendix D and TP, Chapter 7). 

The kinetic energy of the electron gas increases as the temperature is in- 
creased: some energy levels are occupied which were vacant at absolute zero, 
and some levels are vacant which were occupied at absolute zero (Fig. .3). The 
Fermi-Dirac distribution gives the probability that an orbital at energy e 
will be occupied in an ideal electron gas in thermal equilibrium: 


/(«) = 77 r1 , 1 ■ (5) 

exp[(e - fi)/k B l j + 1 

The quantity p is a function of the temperature; p is to be chosen for the 
particular problem in such a way that the total number of particles in the system 
comes out correctly — that is, equal to N. At absolute zero p — e F , because in the 
limit T — > 0 the function /(e) changes discontinuously from the value 1 (filled) to 
the value 0 (empty) at e = e F = /a. At all temperatures f(e) is equal to \ when 
e = p, for then the denominator of (5) has the value 2. 



e/kg, in units of 10 4 K 


Figure 3 Fermi- Dime distribution function (5) at the various labelled temperatures, for 
T f = e b /k b = 50,000 K. The results apply to a gas in three dimensions. The total number of parti- 
cles is constant, independent of temperature. The chemical potential /i at each temperature may 
be read off the graph as the energy at which f = 0.5, 
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The quantity fi is the chemical potential (TP, Chapter 5), and we see 
that at absolute zero the chemical potential is equal to the Fermi energy, de- 
fined as the energy of the topmost filled orbital at absolute zero. 

The high energy tail of the distribution is that part for which e — k B T- 
here the exponential term is dominant in the denominator of (5), so 
that /(e) = exp[(ju — e)/k B T]. This limit is called the Boltzmann or Maxwell 
distribution. 


FREE ELECTRON GAS IN THREE DIMENSIONS 


The free-particle Schrodinger equation in three dimensions is 


it 

2m 



T~2 TT ) 'KM = Mr) 
dy dz 


( 6 ) 


If the electrons are confined to a cube of edge L, the wavefunction is the 
standing wave 

ii> n ( T ) = A sin (irn^x/L) sin ( im y y!L ) sin ( 7 m z z/L) , (7) 


where n x , n v n. are positive integers. The origin is at one corner of the cube. 

It is convenient to introduce wavefunctions that satisfy periodic boundary 
conditions, as we did for phonons in Chapter 5. We now require the wavefunc- 
tions to be periodic in x, ij, z with period L. Thus 

+ L , y, z) = 1 j/(x, y, z) , (8) 


and similarly for the y and z coordinates. Wavefunctions satisfying the free- 
particle Schrodinger equation and the periodicity condition are of the form of 
a traveling plane wave: 


i/ffc(r) — exp (ik • r) , 


provided that the components of the wavevector k satisfy 


K = 0 


. 

- L ’ 


.,.47 r 

~L ; 


(9) 

( 10 ) 


and similarly for k y and k z . 

Any component of k of the form 2mr/L will satisfy the periodicity 
condition over a length L, where n is a positive or negative integer. The com- 
ponents of k are the quantum numbers of the problem, along with the 
quantum number m s for the spin direction. We confirm that these values of k x 
satisfy (8), for 


exp[ifc/x + L)] = exp|i2«7r(i' + L)/L] 

— exp(i2n7rx/L) exp(t2mr) = exp(i2n77a/L) = exp(ifcjx) . (II) 
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On substituting (9) in (6) we have the energy e k of the orbital with 
wavevector k: 


= Ib^ 2 = lb (** + K + *?) 


( 12 ) 


2m 2 m vx « 

The magnitude k of the wavevector is related to the wavelength A by k = 2 tt/\ . 

The linear momentum p may be represented in quantum mechanics by 
the operator p = —ihV, whence for the orbital (9) 

P</'k(r) = -ihVi/j k (r) = fikift k (r) , (13) 


so that the plane wave i/» k is an eigenfunction of the linear momentum with the 
eigenvalue hk. The particle velocity in the orbital k is given by v = fik/m. 

In the ground state of a system of N free electrons, the occupied orbitals 
may be represented as points inside a sphere in k space. The energy at the sur- 
face of the sphere is the Fermi energy; the wavevectors at the Fermi surface 
have a magnitude k F such that (Fig. 4): 


e F ~ 



(14) 


From (10) we see that there is one allowed wavevector — that is, one dis- 
tinct triplet of quantum numbers k x , k tJ , k , — for the volume element (2ir/L) 3 of 
k space. Thus in the sphere of volume 47r/c'p/3 the total number of orbitals is 


^ 4vkp/3 
2 ’ (27 r/L) 3 



(15) 


where the factor 2 on the left comes from the two allowed values of the spin 
quantum number for each allowed value of k. Then (15) gives 


k F — 



(16) 


which depends only on the particle concentration. 


Figure 4 In the ground state of a system of N free 
electrons the occupied orbitals of the system fill a 
sphere of radius k F , where e F = is the energy of 

an electron having a wavevector k F . 


k z 



Table 1 Calculated free electron Fermi surface parameters for metals at room temperature 


(Except for Na, K, Rb, Cs at 5 K and Li at 78 K) 


Valency 

Metal 

Electron 
concentration, 
in cm -3 

Radius* 

parameter 

r n 

Fermi 

wavevector, 

in cm -1 

Fermi 
velocity, 
in cm s" 1 

F ermi 
energy', 
in eV 

F ermi 

temperature 

T- f — £*■/&£> 
in deg K 

i 

Li 

4.70 X 10 22 

3.25 

1.11 X 10 8 

1.29 X 10 s 

4.72 

5.48 X 10 4 


Na 

2.65 

3.93 

0.92 

1.07 

3.23 

3.75 


K 

1.40 

4.86 

0.75 

0.86 

2.12 

2.46 


Rb 

1.15 

5.20 

0.70 

0.81 

1.85 

2.15 


Cs 

0.91 

5.63 

0.64 

0.75 

1.58 

1.83 


Cu 

8.45 

2.67 

1.36 

1.57 

7.00 

8.12 


A g 

5.85 

3.02 

1.20 

1.39 

5.48 

6.36 


Au 

5.90 

3.01 

1.20 

1.39 

5.51 

6.39 

2 

Be 

24.2 

1.88 

1.93 

2.23 

14.14 

16.41 


Mg 

8.60 

2.65 

1.37 

1.58 

7.13 

8.27 


Ca 

4.60 

3.27 

1.11 

1.28 

4.68 

5.43 


Sr 

3.56 

3.56 

1.02 

1.18 

3.95 

4.58 


Ba 

3.20 

3.69 

0.98 

1.13 

3.65 

4.24 


Zn 

13.10 

2.31 

1.57 

1.82 

9.39 

10.90 


Cd 

9.28 

2.59 

1.40 

1.62 

7.46 

8.66 

3 

AI 

18.06 

2.07 

1.75 

2.02 

11.63 

13.49 


Ga 

15.30 

2.19 

1.65 

1.91 

10.35 

12.01 


In 

11.49 

2.41 

1.50 

1.74 

8.60 

9.98 

4 

Pb 

13.20 

2.30 

1.57 

1.82 

9.37 

10.87 


Sn(tt) 

14.48 

2.23 

1.62 

1.88 

10.03 

11.64 


’The dimensionless radius parameter is defined as r„ = r 0 /a H , where a u is the first Bohr radius and r 0 is the radius of a sphere that contains one electron, 


Density of orbitals, relative scale 
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D(c) 



e F 


Energy, e 


Figure 5 Density of single-particle states as a func- 
tion of energy, for a free electron gas in three dimen- 
sions. The dashed curve represents the density 
/ (e, T)D(e) of filled orbitals at a finite temperature, 
but such that k B T is small in comparison with e F . The 
shaded area represents the filled orbitals at absolute 
zero. The average energy is increased when the tem- 
perature is increased from 0 to T, for electrons are 
thermally excited from region 1 to region 2. 


Using (14) and (16), 


V I 

(^N\ m 

I Ji 

II 

Vi) 

{ V ) ■ 


(17) 


This relates the Fermi energy to the electron concentration N/V. The electron 
velocity v F at the F ermi surface is 


v F = 





(18) 


Calculated values of k F , v F , and e F are given in Table 1 for selected metals; also 
given are values of the quantity T F which is defined as e F /k B . (The quantity T F 
has nothing to do with the temperature of the electron gas!) 

We now find an expression for the number of orbitals per unit energy 
range, D(e), called the density of states. 1 We use (17) to obtain the total 
number of orbitals of energy 


N 


y / 2me\ a ' 2 


so that the density of states (Fig. 5) is 


D(e) 


dN = V_ ( 2mY 2 1/2 

de 27j 2 \h 2 ) ' e 


(19) 


(20) 


Strictly, D(e) is the density of one-particle states, or density of orbitals. 
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This result may be expressed more simply by comparing (19) and (20) to obtain 
at e 

**>- f=f- (21) 

Within a factor of the order of unit}', the number of orbitals per unit energy 
range at the Fermi energy is the total number of conduction electrons divided 
by the Fermi energy, just as we would expect. 


HEAT CAPACITY OF THE ELECTRON GAS 

The question that caused the greatest difficulty in the early development 
; of the electron theory of metals concerns the heat capacity of the conduction 
I electrons. Classical statistical mechanics predicts that a free particle should 
[ have a heat capacity of | k B , where k B is the Boltzmann constant. If N atoms 
each give one valence electron to the electron gas, and the elections are freely 
| mobile, then the electronic contribution to the heat capacity should be | Nk B , 

\ just as for the atoms of a monatomic gas. But the observed electronic contribu- 
| tion at room temperature is usually less than 0.01 of this value. 

This important discrepancy distracted the early workers, such as Lorentz: 
| How can the electrons participate in electrical conduction processes as if they 
[ were mobile, while not contributing to the heat capacity? The question was 
answered only upon the discovery of the Pauli exclusion principle and the 
Fermi distribution function. Fermi found the correct result and he wrote, 
“One recognizes that the specific heat vanishes at absolute zero and that at low 
temperatures it is proportional to the absolute temperature.” 

When we heat the specimen from absolute zero, not every electron gains 
an energy ~k s T as expected classically, but only those electrons in orbitals 
within an energy range k B T of the Fermi level are excited thermally, as in 
| Fig. 5. This gives an immediate qualitative solution to the problem of the heat 
l capacity of the conduction electron gas. If N is the total number of electrons, 
} only a fraction of the order of T/T F can be excited thermally at temperature T, 
\ because only these lie within an energy range of the order of k B T of the top of 
1 the energy distribution. 

Each of these NT/T F electrons has a thermal energy of the order of k e T. 
The total electronic thermal kinetic energy U is of the order of 

U d ~ ( NT/T F )k B T . (22) 

The electronic heat capacity is given by 

C d = 3U/9T ~ Nk B (T/T F ) (23) 

and is directly proportional to T, in agreement with the experimental 
results discussed in the following section. At room temperature C d is smaller 
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than the classical value iNk^ by a factor of the order of 0.01 or less, for 
T f ~ 5 X 10 4 K. 

We now derive a quantitative expression for the electronic heat capacity 
valid at low temperatures k„T <s e F . The increase A U = U(T) — [7(0) in the 
total energy (Fig- 5) of a system of N electrons when heated from 0 to T is 

de eD(e) . (24) 

Here /(e) is the Fermi-Dirac function (5): 

(24a) 

and D(e) is the number of orbitals per unit energy range. We multiply the 
identity 

N = [ de D(e)f(e) = I de D(e) (25) 

Jo Jo 

by e F to obtain 

[ + f ) de e F f(e)D(e) = f de e F D(e) . (26) 

J o J e F J J o 

We use (26) to rewrite (24) as 

AU = f de{e - e F ) f(e)D[e) + f de(e F - e)[l -f(e)]D(e) . (27) 

J e r J o 

The first integral on the right-hand side of (27) gives the energy needed to 
take electrons from e F to the orbitals of energy e > e F , and the second integral 
gives the energy needed to bring the electrons to e F from orbitals below e F . 
Both contributions to the energy are positive. 

The product f(e)D(e)de in the first integral of (27) is the number of 
electrons elevated to orbitals in the energy range de at an energy e. The factor 
[1 — /(e)] in the second integral is the probability that an electron has been 
removed from an orbital e. The function AU is plotted in Fig. 6. 

The heat capacity’ of the electron gas is found on differentiating AU with 
respect to T. The only temperature-dependent term in (27) is /(e), whence we 
can group terms to obtain 

r lii f" df 

c -“Sr*Jo* (e_eF> 5T C(el ' < 28) 

At the temperatures of interest in metals, r/e F < 0.01, and we see from 
Fig. 3 that (e — e F ) df/dT has large positive peaks at energies near e F . It is a 



d.eeD(e)f(e)- 
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Figure 7 Plot of the chemical potential /r versus temperature as k s T for a gas of noninteracting 
fermions in three dimensions. For convenience in plotting, the units of p and k e T are 0.763e F . 

good approximation to evaluate the density of states D(e) at e F and take it 
outside of the integral: 

f • df 

C el = D(e F ) de(e . (29) 

Examination of the graphs in Figs. 7 and 8 of the variation of the chemical 
potential /a with T suggests that when k B T < e F we ignore the temperature 
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Figure 8 Variation with temperature of the chemical 
potential /i, for free electron Fermi gases in one and 
three dimensions. In common metals r/e F — 0.01 at 
room temperature, so that p. is closely equal to e F . 
These curves were calculated from series expansions 
of the integral for the number of particles in the 
system. 



e F 


dependence of the chemical potential ju. in the Fermi-Dirac distribution func- 
tion and replace /x by the constant e F . We have then, with r = k B T, 


We set 


df _ e — e F exp[(e - e F )/-r] 
dr r 2 (exp[(e - e F )/r ] -I- l) 2 


* = (e - e F )n , 


(30) 

(31) 


and it follows from (29) and (30) that 

C el = k\T D(e e ) f’ dx . (32) 

J -< f ‘t [e + 1)" 

We may safely replace the lower limit by — 00 because the factor e* in the inte- 
grand is already negligible at x — ~€ f /t if we are concerned with low tempera- 
tures such that e F /r ~ 100 or more. The integral in (32) then becomes 



e x 

(e T + l ) 2 


■7T 2 

3 ’ 


whence the heat capacity of an electron gas is 


C d = l **D(eJk*T . 

From (21) we have 


(33) 


(34) 


D(e f ) = 3N/2e F = 3N/2k B T F 

for a free electron gas, with k B T F = e P . Thus (34) becomes 

C sl = iidNk B T/T F . 


(35) 


( 36 ) 
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Recall that although T F is called the Fermi temperature, it is not the electron 
temperature, but only a convenient reference notation. 

Experimental Heat Capacity of Metals 

At temperatures much below both the Debye temperature $ and the 
Fermi temperature T p , the heat capacity' of metals may be written as the sum 
of electron and phonon contributions: C = yT + AT 3 , where y and A are con- 
stants characteristic of the material. The electronic term is linear in T and is 
dominant at sufficiently low temperatures. It is convenient to exhibit the ex- 
perimental values of C as a plot of C/T versus T 2 : 

C/T=y + AJ* , (37) 

for then the points should lie on a straight line with slope A and intercept y. 
Such a plot for potassium is shown in Fig. 9. Observed values of y, called the 
Sommerfeld parameter, are given in Table 2. 

The observed values of the coefficient y are of the expected magnitude, 
but often do not agree very closely with the value calculated for free electrons 
of mass m by use of (17) and (34). It is common practice to express the ratio of 
the observed to the free electron values of the electronic heat capacity as a 
ratio of a thermal effective mass m,|, to the electron mass m , where m th is de- 
fined by the relation 

m th = y(ohserved) 

m y(free) ' ( ' 

This form arises in a natural way because e F is inversely proportional to the 
mass of the electron, whence y « m. Values of the ratio are given in Table 2. 
The departure from unity involves three separate effects: 

• The interaction of the conduction electrons with the periodic potential of 
the rigid crystal lattice. The effective mass of an electron in this potential is 
called the band effective mass. 



Figure 9 Experimental heat capacity values for potassium, plotted as C/T versus T (After 
W. H. Lien and N. E. Phillips.) 




Li 

1.63 

Be 

0.17 

Table 2 Experimental and free electron values of electronic heat 
capacity constant y of metals 

(From compilations kindly furnished by N. Phillips and N. Pearlman. The 
thermal effective mass is defined by Eq. (38).) 

B 

C 

N ! 

0.749 

0.500 




2.18 

0.34 










Na 

1.38 

Mg 

1.3 




Observed y in mJ mo 

.., K -,. 




Al 

1.35 

Si 

P 

1.094 

0.992 

Calculated free electron y in mJ mor ' K - 

0.912 



1.26 

1.3 

m^y/m — (observed y)/(free electron y). 

1.48 



K 

Ca 

Sc 

Ti 

V 

Cr 

Mn(y) 

Fe 

Co 

Ni 

Cu 

Zn 

Ga 

Ge 

As 

2.08 

2.9 

10.7 

3.35 

9.26 

1.40 

9.20 

4.98 

4.73 

7.02 

0.695 

0.64 

0.596 


0.19 

1.668 

1.511 









0.505 

0.753 

1.025 



1.25 

1.9 









1.38 

0.85 

0.58 



Rb 

Sr 

Y 

Zr 

Nb 

Mo 

Tc 

Ru 

Rh 

Pd 

A g 

Cd 

In 

Sn <w) 

Sb 

2.41 

3.6 

10.2 

2.80 

7.79 

2.0 

— 

3.3 

4.9 

9.42 

0.646 

0.688 

1.69 

1.78 

0.11 

1.911 

1.790 



' 






0.645 

0.948 

1.233 

1.410 


1.26 

2.0 









1.00 

0.73 

1.37 

1.26 


Cs 

Ba 

La 

Hf 

Ta 

W 

Re 

Os 

Ir 

Pt 

Au 

Hg(«) 

TI 

Pb 

Bi | 

3.20 

2.7 

10. 

2.16 

5.9 

1.3 

2.3 

2.4 

3.1 

6.8 

0.729 

1.79 

1.47 

2.98 

0.008 

2.238 

1.937 




hMhi 





0.642 

0.952 

1.29 

1.509 


1.43 

1.4 









1.14 

1.88 

1.14 

1.97 
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• The interaction of the conduction electrons with phonons. An electron 
tends to polarize or distort the lattice in its neighborhood, so that the mov- 
ing electron tries to drag nearby ions along, thereby increasing the effective 
mass of the electron. 

• The interaction of the conduction electrons with themselves. A moving elec- 
tron causes an inertial reaction in the surrounding electron gas, thereby in- 
creasing the effective mass of the electron. 

Heavy Fermions. Several metallic compounds have been discovered that have 
enormous values, two or three orders of magnitude higher than usual, of the elec- 
tronic heat capacity constant y. The heavy fermion compounds include UBe 13 , 
CcAl 3 , and CeCu 2 Si 2 . It has been suggested that/ electrons in these compounds 
may have inertial masses as high as 1000 m, because of the weak overlap of wave- 
functions of/electrons on neighboring ions (see Chapter 9, “tight binding”). 


ELECTRICAL CONDUCTIVITY AND OHM’S LAW 


The momentum of a free electron is related to the wavevector by mv = ftk. 
In an electric field E and magnetic field B the force F on an electron of charge 
— e is —e[E + (1 /c)v X B], so that Newton’s second law of motion becomes 


(CGS) 



(39} 


In the absence of collisions the Fermi sphere (Fig. 10) moves in k space at a 
uniform rate by a constant applied electric field. We integrate (39) with B = 0 
to obtain 


k(f) - k(0) - -eEt/h . (40) 

If the force F = — eE is applied at time t = 0 to an electron gas that fills 
the Fermi sphere centered at the origin of k space, then at a later time t the 
sphere will be displaced to a new center at 

5k = -eEt/h . (41) 

Notice that the Fermi sphere is displaced as a whole because every electron is 
displaced by the same Sk. 

Because of collisions of electrons with impurities, lattice imperfections, and 
phonons, the displaced sphere may be maintained in a steady state in an electric 
field. If the collision time is t, the displacement of the Fermi sphere in the 
steady state is given by (41) with t = t. The incremental velocity is v =jfek/ra = 
— eEr/w. If in a constant electric field E there are n electrons of charge q — — e 
per unit volume, the electric current density is 

j = nqv = ne 2 rE/m . 


This is Ohm’s law. 


(42) 
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(a) (b) 


Figure 10 (a) The Fermi sphere encloses the occupied electron orbitals in k space in the ground 
state of the electron gas. The net momentum is zero, because for every orbital k there is an occu- 
pied orbital at — k. (b) Under the influence of a constant force F acting for a time interval f every 
orbital has its k vector increased hy 5k - F t/h. This is equivalent to a displacement of the whole 
Fermi sphere by 8k. The total momentum is NhS k, if there are N electrons present. The applica- 
tion of the force increases the energy of the system by N(hS k) 2 /2m. 


The electrical conductivity cr is defined by j = crE, so by (42) 



The electrical resistivity p is defined as the reciprocal of the conductivity, 
so that 

p = m/ne~T . (44) 

Values of the electrical conductivity and resistivity of the elements are given in 
Table 3. In Gaussian units cr has the dimensions of frequency. 

It is easy to understand the result (43) for the conductivity of a Fermi gas. 
We expect the charge transported to be proportional to the charge density ne; 
the factor elm enters (43) because the acceleration in a given electric field is 
proportional to e and inversely proportional to the mass rn. The time r describes 
the free time during which the field acts on the carrier. Closely the same result 
for the electrical conductivity is obtained for a classical (Maxwellian) gas of elec- 
trons, as realized at low carrier concentration in many semiconductor problems. 

Experimental Electrical Resistivity of Metals 

The electrical resistivity of most metals is dominated at room temperature 
(300 K) hy collisions of the conduction electrons with lattice phonons and at 







Table 3 Electrical conductivity and resistivity of metals at 295 K 

(Resistivity values as given by G. T. Meaden, Electrical resistance of 
metals, Plenum, 1965; residual resistivities have been subtracted.) 


Conductivity in units of 10 s (ohm cm)'"*. 

Resistivity in units of 1CT‘ ohm-cm. 


Al 

3.65 

2.74 


Si 


Cl 


Ne 


Ar 


Sc 

Ti 

V 

Cr 

Mn 

Fe 

Co 

Ni 

Cu 

Zn 

Ga 

0.21 

0.23 

0.50 

0.78 

0.072 

1.02 

1.72 

1.43 

5.88 

1.69 

0.67 

46.8 

43.1 

19.9 

12.9 

139. 

9.8 

5.8 

7.0 

1.70 

5.92 

14.85 

Y 

Zr 

Nb 

Mo 

■ 

Ru 

Rh 

1 

Ag 

Cd 

In 

0.17 

0 24 

0.69 

; 1.89 

BEHI 

1.35 

2.08 

rags 

6.21 

1.38 

1.14 

58.5 

42.4 

14.5 

5.3 

-14. 

7.4 

4.8 

mjjjH 

1.61 

7.27 

8.75 

La 

Hf 

Ta 

W 

Re 

Os 

Ir 

pt 

Au 

Hg liq 

TI 

0.13 

0.33 

0.76 

1.89 

0.54 

1,10 

1.96 

0.96 

4.55 

0.10 

0.61 

79. 

30.6 

13.1 

5.3 

18.6 

9.1 

5.1 

10.4 

2.20 

95.9 

16.4 


Ge 


As 


Se 


Br 


Kr 


Sn (w) 

0.91 

11.0 


Sb 

0.24 

41.3 


Te 


Xe 


Pb 

0,48 

21.0 


Bi 

0.086 

116. 


Po 

0.22 

46. 


At 


Rn 
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(a) (b) 

Figure 11 Electrical resistivity in most metals arises from collisions of electrons with irregulari- 
ties in the lattice, as in (a) by phonons and in (b) by impurities and vacant lattice sites. 


liquid helium temperature (4 K) by collisions with impurity atoms and me- 
chanical imperfections in the lattice (Fig. 11). The rates of these collisions 
are often independent to a good approximation, so that if the electric field 
were switched off the momentum distribution would relax back to its ground 
state with the net relaxation rate 


I 

T 


= J- + I 

Tl T i 


(45) 


where t l and t, are the collision times for scattering by phonons and by imper- 
fections, respectively. 

The net resistivity is given by 

P = PL + Pi , (46) 

where p L is the resistivity caused by the thermal phonons, and p, is the resistiv- 
ity caused by scattering of the electron waves by static defects that disturb the 
periodicity of the lattice. Often p L is independent of the number of defects 
when their concentration is small, and often p ; is independent of temperature. 
This empirical observation expresses Matthiessen’s rule, which is convenient 
in analyzing experimental data (Fig. 12). 

The residual resistivity, p f (0), is the extrapolated resistivity at 0 K because 
p L vanishes as T — > 0. The lattice resistivity, p L (T) = p — p,(0), is the same for 
different specimens of a metal, even though p f ( 0) may itself vary widely. The 
resistivity ratio of a specimen is usually defined as the ratio of its resistivity at 
room temperature to its residual resistivity. It is a convenient approximate in- 
dicator of sample purity: for many materials an impurity in solid solution cre- 
ates a residual resistivity of about 1 p,ohm-cm (1 X 10 -6 ohm-cm) per atomic 
percent of impurity. A copper specimen with a resistivity ratio of 1000 
will have a residual resistivity of 1.7 X 10~ 3 p.ohm-cm, corresponding to an 
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Temperature, K 


Figure 12 Resistance of potassium below 
20 K, as measured on two specimens by 
D. MacDonald and K. Mendelssohn. The 
different intercepts at 0 K are attributed to 
different concentrations of impurities and 
static imperfections in the two specimens. 


| impurity concentration of about 20 ppm. In exceptionally pure specimens the 
; resistivity ratio may be as high as 10 6 , whereas in some alloys (e.g., manganin) 
: it is as low as 1.1. 

It is possible to obtain crystals of copper so pure that their conductivity at 
[■ liquid helium temperatures (4 K) is nearly 10 5 times that at room temperature; 

for these conditions t“ 2X 10 -9 s at 4 K. The mean free path € of a conduc- 
\ tion electron is defined as 

f = v f t , (47) 


where v F is the velocity at the Fermi surface, because all collisions involve only 
electrons near the Fermi surface. From Table 1 we have v F — 1.57 X 10 8 cm s 1 
for Cu, thus the mean free path is f(4 K.) ~ 0.3 cm. Mean free paths as long as 
10 cm have been observed in very pure metals in the liquid helium tempera- 
ture range. 

The temperature-dependent part of the electrical resistivity is proportional 
to the rate at which an electron collides with thermal phonons and thermal elec- 
trons. The collision rate with phonons is proportional to the concentration of 
thermal phonons. One simple limit is at temperatures over the Debye tempera- 
ture 0: here the phonon concentration is proportional to the temperature T, so 
that p T for T > 9. A sketch of the theory is given in Appendix J. 


Umklapp Scattering 

Umklapp scattering of electrons by phonons (Chapter 5) accounts for 
most of the electrical resistivity' of metals at low temperatures. These are 
electron-phonon scattering processes in which a reciprocal lattice vector G is 
involved, so that electron momentum change in the process may be much larger 
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Figure 13 Two Fermi spheres in adjacent 
zones: a construction to show the role of phonon 
umklapp processes in electrical resistivity. 



than in a normal electron-phonon scattering process at low temperatures. (In an 
umklapp process the wavevector of one particle may be “flipped over.”) 

Consider a section perpendicular to [100] through two adjacent Brillouin 
zones in bcc potassium, with the equivalent Fermi spheres inscribed within 
each (Fig. 13). The lower half of the figure shows the normal electron-phonon 
collision k' = k + q, while the upper half shows a possible scattering process 
k' = k + q + G involving the same phonon and terminating outside the first 
Brillouin zone, at the point A. This point is exactly equivalent to the point A' 
inside the original zone, where AA' is a reciprocal lattice vector G. This scat- 
tering is an umklapp process, in analogy to phonons. Such collisions are strong 
scatterers because the scattering angle can be close to it. 

When the Fermi surface does not intersect the zone boundary, there is 
some minimum phonon wavevector q Q for umklapp scattering. At low enough 
temperatures the number of phonons available for umklapp scattering falls 
as exp ( — Ofj/T), where 6 V is a characteristic temperature calculable from the 
geometry of the Fermi surface inside the Brillouin zone. For a spherical Fermi 
surface with one electron orbital per atom inside the bcc Brillouin zone, one 
shows by geometry that q 0 = 0.267 k F . 

The experimental data (Fig. 12) for potassium have the expected exponen- 
tial form with d v = 23 K compared with the Debye 6 = 91 K. At the very low- 
est temperatures (below about 2 K in potassium) the number of umklapp 
processes is negligible and the lattice resistivity is then caused only by small 
angle scattering, which is the normal (not umklapp) scattering. 


MOTION IN MAGNETIC FIELDS 

By the arguments of (39) and (41) we are led to the equation of motion for 
the displacement 8k of a Fermi sphere of particles acted on by a force F and 
by friction as represented by collisions at a rate 1 /t: 


sk=F - 


( 48 ) 
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The free particle acceleration term is ( hd/dt ) 5k and the effect of collisions 
(the friction) is represented by fi8\dr, where r is the collision time. 

Consider now the motion of the system in a uniform magnetic field B. The 
Lorentz force on an electron is 


(CGS) 


F = -e| E + * v X B 


(49) 


(SI) F = — e(E + v X B) 

If m\ = Mk, then the equation of motion is 

(CGS) m (| + ^)v =_e ( E+ JvXB 


(50) 


An important situation is the following: let a static magnetic field B lie 
along the 3 axis. Then the component equations of motion are 


(CGS) 


m 'ji + ^) v * = ~ e ( E * + % v y) ; 


, d , 1 \ _ r B 

m [dt + r v y-- e [ E y-c^ 


(51) 


The results in SI are obtained by replacing c by 1. 

In the steady state in a static electric field the time derivatives are zero, so 
that the drift velocity is 


er ~ 

jjl <J x (*) c TO y 


€T 

Jn Ey (o c tv x 


er r 
= ~m E -- 


(52) 


where to c = eB/mc is the cyclotron frequency, as discussed in Chapter 8 for 
cyclotron resonance in semiconductors. 

Hall Effect 

The Hall field is the electric field developed across two faces of a conduc- 
tor, in the direction j x B, when a current j flows across a magnetic field B. 
Consider a rod-shaped specimen in a longitudinal electric field E x and a trans- 
verse magnetic field, as in Fig. 14. If current cannot flow out of the rod in the 
y direction we must have bv y = 0. From (52) this is possible only if there is a 
transverse electric field 


E y = —(o c tE x = 


£®I e ■ 

me x ’ 


(CGS) 


(53) 
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Magnetic field B. 




-Ex 



in steady state. 



x 


Figure 14 The standard geometry for the Hall effect: a rod-shaped specimen of rectangular 
cross-section is placed in a magnetic field B., as in (a). An electric field E x applied across the end 
electrodes causes an electric current density j x to flow down the rod. The drift velocity of the 
negatively-charged electrons immediately after the electric field is applied as shown in (b). The 
deflection in the — y direction is caused by the magnetic field. Electrons accumulate on one face 
of the rod and a positive ion excess is established on the opposite face until, as in (c), the trans- 
verse electric field (Hall field) just cancels the Lorentz force due to the magnetic field. 


(SI) 


Ey — —o> c tE x = — 


eBr p 
rn 1 • 


The quantity defined by 



( 54 ) 


is called the Hall coefficient. To evaluate it on our simple model we use j x = 
ne 2 TEJm and obtain 


(CGS) 


e.BrEJmc _ \ 

ne 2 rE x B/m ~ nec ’ 


( 55 ) 


(SI) 



This is negative for free electrons, for e is positive by definition. 
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Table 4 Comparison of observed Hall coefficients with free electron theory 

[The experimental values of R H as obtained by conventional methods are summarized from data at 
room temperature presented in the Landolt-Born.stein tables. The values obtained by the helicon 
wave method at 4 K are by J. M. Goodman. The values of the carrier concentration n are from 
Table 1.4 except for Na, K, Al, In, where Goodman’s values are used. To convert the value of R H in 
CGS units to the value in volt-em/amp-gauss, multiply by 9 X 10 11 ; to convert R tI in CGS to 
m 3 /coulomb, multiply by 9 X 10 l0 .] 


Metal 

Method 

Experimental 

in 10 24 CGS units 

Assumed 
carriers 
per atom 

Calculated 

— l/nec, 

in 10 -24 CGS units 

Li 

conv. 

-1.89 

1 electron 

-1.48 

Na 

helicon 

-2.619 

1 electron 

-2.603 


conv. 

-2.3 



K 

helicon 

-4.946 

1 electron 

-4.944 


conv. 

-4.7 



Rb 

conv. 

-5.6 

1 electron 

-6.04 

Cu 

conv. 

-0.6 

1 electron 

-0.82 

A § 

conv. 

-1.0 

1 electron 

-1.19 

Au 

conv. 

-0.8 

1 electron 

-1.18 

Be 

conv. 

+2.7 

— 

— 

Mg 

conv. 

-0.92 

— 

— 

Al 

helicon 

+ 1.136 

1 hole 

+ 1.135 

In 

helicon 

+ 1.774 

1 hole 

+ 1.780 

As 

conv. 

+50. 

— 

— 

Sb 

conv. 

-22. 

— 

— 

Bi 

conv. 

-6000. 

— 

— 


The lower the carrier concentration, the greater the magnitude of the 
Hall coefficient. Measuring R H is an important way of measuring the carrier 
concentration. Note: The symbol R H denotes the Hall coefficient (54), but the 
same symbol is sometimes used with a different meaning, that of Hall resis- 
tance in two-dimensional problems. 

The simple result (55) follows from the assumption that all relaxation 
times are equal, independent of the velocity of the electron. A numerical fac- 
tor of order unity enters if the relaxation time is a function of the velocity. The 
expression becomes somewhat more complicated if both electrons and holes 
contribute to the conductivity. 

In Table 4 observed values of the Hall coefficient are compared with val- 
ues calculated from the carrier concentration. The most accurate measure- 
ments are made by the method of helicon resonance which is treated as a 
problem in Chapter 14. 

The accurate values for sodium and potassium are in excellent agreement 
with values calculated for one conduction electron per atom, using (55). 


Notice, however, the experimental values for the trivalent elements aluminum 
and indium: these agree with values calculated for one positive charge carrier 
per atom and thus disagree in magnitude and sign with values calculated for 
the expected three negative charge carriers. 

The problem of an apparent positive sign for the charge carriers arises 
also for Be and As, as seen in the table. The anomaly of the sign was explained 
by Peierls (1928). The motion of carriers of apparent positive sign, which 
Heisenberg later called “holes,” cannot be explained by a free electron gas, but 
finds a natural explanation in terms of the energy band theory to be developed 
in Chapters 7-9. Band theory also accounts for the occurrence of very large 
values of the Hall coefficient, as for As, Sb, and Bi. 


THERMAL CONDUCTIVITY OF METALS 

In Chapter 5 we found an expression K = 5 CrA for the thermal conductiv- 
ity of particles of velocity v, heat capacity C per unit volume, and mean free 
path f . The thermal conductivity of a Fermi gas follows from (36) for the heat 
capacity, and with e y = , mv \ : 


_ ^ nk%T _ irnklTr 
3 tnvp F 3m 


(56) 


Here i — v f t: the electron concentration is n , and r is the collision time. 

Do the electrons or the phonons carry the greater part of the heat current 
in a metal? In pure metals the electronic contribution is dominant at all tem- 
peratures. In impure metals or in disordered alloys, the electron mean free 
path is reduced by collisions with impurities, and the phonon contribution may 
be comparable with the electronic contribution. 

Ratio of Thermal to Electrical Conductivity 

The Wiedemann-Franz law states that for metals at not too low temper- 
atures the ratio of the thermal conductivity to the electrical conductivity is 
directly proportional to the temperature, with the value of the constant ot 
proportionality independent of the particular metal. This result was important 
in the history of the theory of metals, for it supported the picture of an 
electron gas as the carrier of charge and energy. It can be explained by using 
(43) for cr and (56) for K: 


K _ A%Tnrl?>m = ^ As V T 
a ner^/m 3 \ e J 


( 57 ) 
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Table 5 Experimental Lorenz numbers 


L X 10 8 watt-ohm/deg 2 

L X 10 8 watt-ohm/deg 2 

Metal 

(TC 

locrc 

Metal 

(TC 

KMTC 

A g 

2.31 

2.37 

Pb 

2.47 

2.56 

Au 

2.35 

2.40 

Pt 

2.51 

2.60 

Cd 

2.42 

2.43 

Su 

2.52 

2.49 

Cu 

2.23 

2.33 

w 

3.04 

3.20 

Mo 

2.61 

2.79 

Zn 

2.31 

2.33 


The Lorenz number L is defined as 


L - K/aT , (58) 

and according to (57) should have the value 

L = 7T (it ) 2 = 2 - 72 X 10-13 (erg/esu-deg) 2 

= 2.45 X 10" 8 watt- ohm/deg 2 . (59) 

This remarkable result involves neither n nor m. Experimental values of L at 
0°C and at 100°C as given in Table 5 are in good agreement with (59). 


Problems 

1. Kinetic energy of electron gas. Show that the kinetic energy of a three-dimensional 
gas of N free electrons at 0 K is 

U 0 = lNe F . (60) 

2. Pressure and bulk modulus of an electron gas. (a) Derive a relation connecting 
the pressure and volume of an electron gas at 0 K Hint: Use the result of Problem 
1 and the relation between e f and electron concentration. The result may be writ- 
ten as p — |((7 0 /V). (b) Show that the bulk modulus B = —V(dp/dV) of an electron 
gas at 0 K is B = 5p/3 = 101V9V. (c) Estimate for potassium, using Table 1, the 
value of the electron gas contribution to B. 

3. Chemical potential in two dimensions. Show that the chemical potential of a 
Fermi gas in two dimensions is given by: 

fj.(T) —k B T\n [expi/mihVmkgT) - lj , (61) 

for n electrons per unit area. Note: The density of orbitals of a free electron gas in 
two dimensions is independent of energy: D(e) = m/irfi 2 , per unit area of specimen. 
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4. Fermi gases in astrophysics, (a) Given Mq — 2 X 10 33 g for the mass of the Sun, 
estimate the number of electrons in the Sun. In a white dwarf star this number of 
electrons may be ionized and contained in a sphere of radius 2 X I0 9 cm; find the 
Fermi energy of the electrons in electron volts, (b) The energy of an electron in the 
relativistic limit e i-> me 2 is related to the wavevector as e — pc — hkc. Show that the 
Fermi energy in this limit is he (N/V) 1 , roughly, (c) If the above number of 
electrons were contained within a pulsar of radius 10 km, show that the Fermi en- 
ergy would be 1 0 s eV. This value explains why pulsars are believed to be composed 
largely of neutrons rather than of protons and electrons, for the energy' release in the 
reaction n i- — > p + e~ is only 0.8 X 10 6 eV, which is not large enough to enable many 
electrons to form a Fermi sea. The neutron decay proceeds only until the electron 
concentration builds up enough to create a Fermi level of 0.8 X 10 c eV, at which 
point the neutron, proton, and electron concentrations are in equilibrium. 

5. Liquid He*. The atom He 3 has spin \ and is a fermion. The density of liquid He 3 
is 0.081 g cm -3 near absolute zero. Calculate the Fermi energy' e F and the Fermi 
temperature T F . 

6. Frequency dependence of the electrical conductivity. Use the equation 
midrJdl + u/t) = —eE for the electron drift velocity v to show that the conductivity 
at frequency id is 


where <r!0) = ne 2 T/m. 


cr((o) = rr(0) 


1 + itOT j 
1 + (tar) 2 ) 


(62) 


*7. Dynamic magnetoconductivity tensor for free electrons. A metal with a concen- 
tration n of free electrons of charge —e is in a static magnetic field Bz. The electric 
current density in the xy plane is related to the electric field by 


]x = 'hA + v ul E tJ ; jj = <J, r E T + . 

Assume that the frequency <d> (D c and &» > 1/t, where oi,, = eB/mc and r is the 
collision time, (a) Solve the drift velocity equation (51) to find the components of 
the magnetoconductivity' tensor: 

<r a - tr yy = ia>p4mo ; = - cy = tDMpATUD 2 , 


where dip = Airne'/ni. (b) Note from a Maxwell equation that the dielectric func- 
tion tensor of the medium is related to the conductivity tensor as e = 1 + i(4ir/a>)(r. 
Consider an electromagnetic wave with wavcvcctor k — fcz. Show that the disper- 
sion relation for this wave in the medium is 


c 9 k 2 = - dip ± a) c a)p(o . (63) 

At a given frequency there are two modes of propagation with different wavevec- 
tors and different velocities. The two modes correspond to circularly polarized 


"This problem is somewhat difficult. 
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waves. Because a linearly polarized wave can he decomposed into two circularly 
polarized waves, it follows that the plane of polarization of a linearly polarized wave 
will be rotated by the magnetic field. 

8. Cohesive energy of free electron Fermi gas. We define the dimensionless 
length r 3 , as r 0 /a H , where r n is the radius of a sphere that contains one electron, 
and a H is the bohr radius hr/e 2 m. (a) Show that the average kinetic energy per elec- 
tron in a free electron Fermi gas at 0 K is 2.21/rf, where the energy is expressed in 
rydbergs, with 1 Ry = me 4 /2A 2 . (b) Show that the coulomb energy of a point posi- 
tive charge e interacting with the uniform electron distribution of one electron in 
the volume of radius r 0 is - 3<r/2r 0 , or — 3/r s in rydbergs. (c) Show that the 
coulomb self-energy of the electron distribution in the sphere is 3e 2 /5r 0 , or 6/5 r s in 
rydbergs. (d) The sum of (b) and (c) gives — 1.80/r s for the total coulomb energy 
per electron. Show that the equilibrium value of r s is 2.45. Will such a metal be 
stable with respect to separated II atoms? 

9. Static magnetoconductivity tensor. For the drift velocity theory of (51), show 
that the static current density can be written in matrix form as 



/ 1 

<r a / 

I <o r T 

1 + Kt f y 0 



In the high magnetic field limit of to c r > 1 , show that 


(64) 


ir^ = nec/B — - <r ly 


( 65 ) 


In this limit <x rr = 0, to order 1/&i,t. The quantity cr, /T is called the Hall 
conductivity. 

10. Maximum surface resistance. Consider a square sheet of side L, thickness d, and 
electrical resistivity p. The resistance measured between opposite edges of the 
sheet is called the surface resistance: R sq = pL/JA = p/d , which is independent of 
the area L 2 of the sheet. (R sq is called the resistance per square and is expressed in 
ohms per square, because p/d has the dimensions of ohms.) If we express p by (44), 
then R sq = m/nde 2 T. Suppose now that the minimum value of the collision time is 
determined by scattering from the surfaces of the sheet, so that t = dUv t , where v F 
is the Fermi velocity. Thus the maximum surface resistivity is Ih >q ~ mv F /ndV. 
Show for a monatomic metal sheet one atom in thickness that ** A/e 2 = 4.1 kfl. 


'This problem is somewhat difficult. 
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Insulator Metal Semimetal Semiconductor Semiconductor 

Figure 1 Schematic electron occupancy of allowed energy bands for an insulator, metal, semi- 
metal, and semiconductor. The vertical extent of the boxes indicates the allowed energy regions; 
the shaded areas indicate the regions filled with electrons. In a semimetal (such as bismuth) one 
band is almost filled and another band is nearly empty at absolute zero, but a pure semiconduc- 
tor (such as silicon) becomes an insulator at absolute zero. The left of the two semiconductors 
shown is at a finite temperature, with carriers excited thermally. The other semiconductor is 
electron-deficient because of impurities. 
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CHAPTER 7: ENERGY BANDS 


When I started to think about it, 1 felt that the 
main problem was to explain how the electrons 
could sneak by all the ions in a metal. . . . By 
straight Fourier analysis I found to my delight 
that the wave differed from the plane wave of 
free electrons only by a periodic modulation. 

F. Bloch 


The free electron model of metals gives us good insight into the heat 
capacity, thermal conductivity, electrical conductivity, magnetic susceptibility, 
and electrodynamics of metals. But the model fails to help us with other large 
questions: the distinction between metals, semimetals, semiconductors, and 
insulators; the occurrence of positive values of the Hall coefficient; the rela- 
tion of conduction electrons in the metal to the valence electrons of free 
atoms; and many transport properties, particularly magnetotransport. We need 
a less naive theory, and fortunately it turns out that almost any simple attempt 
to improve upon the free electron model is enormously profitable. 

The difference between a good conductor and a good insulator is striking. 
The electrical resistivity of a pure metal may be as low as 10 10 ohm-cm at a 
temperature of 1 K, apart from the possibility of superconductivity. The resis- 
tivity of a good insulator may be as high as 10 22 ohm-cm. This range of 10 32 
may be the widest of any common physical property of solids. 

Every solid contains electrons. The important question for electrical con- 
ductivity is how the electrons respond to an applied electric field. We shall see 
that electrons in crystals are arranged in energy bands (Fig. 1) separated by- 
regions in energy for which no wavelike electron orbitals exist. Such forbidden 
regions are called energy gaps or band gaps, and result from the interaction 
of the conduction electron waves with the ion cores of the crystal. 

The crystal behaves as an insulator if the allowed energy bands are either 
filled or empty', for then no electrons can move in an electric field. The crystal 
behaves as a metal if one or more bands are partly filled, say between 10 and 
90 percent filled. The crystal is a semiconductor or a semimetal if one or two 
bands are slightly filled or slightly empty. 

To understand the difference between insulators and conductors, we must 
extend the free electron model to take account of the periodic lattice of the solid. 
The possibility of a band gap is the most important new property that emerges. 

We shall encounter other quite remarkable properties of electrons in crys- 
tals. For example, they respond to applied electric or magnetic fields as if the 
electrons w-ere endowed with an effective mass m*, which may be larger or 
smaller than the free electron mass, or may even be negative. Electrons in 
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crystals respond to applied fields as if endowed with negative or positive 
charges, — e or +e, and herein lies the explanation of the negative and positive 
values of the Hall coefficient. 


NEARLY FREE ELECTRON MODEL 


On the free electron model the allowed energy values are distributed es- 
sentially continuously from zero to infinity. We saw in Chapter 6 that 


e k 


It 

2m 


(kl + kl + kl) , 


( 1 ) 


where, for periodic boundary conditions over a cube of side L, 


k x ,k y ,k z = 0 ; 


27 7 

L ; 


477 
L ; 


( 2 ) 


The free electron wavefunctions are of the form 

i/» k (r) = exp(ik • r) ; (3) 

they represent running waves and carry momentum p = Ml. 

The band structure of a crystal can often be explained by the nearly free 
electron model for which the band electrons are treated as perturbed only 
weakly by the periodic potential of the ion cores. This model answers almost 
all the qualitative questions about the behavior of electrons in metals. 

We know that Bragg reflection is a characteristic feature of wave propaga- 
tion in crystals. Bragg reflection of electron waves in crystals is the cause of 
energy gaps. (At Bragg reflection wavelike solutions of the Schrodinger equa- 
tion do not exist, as in Fig. 2.) These energy gaps are of decisive significance in 
determining whether a solid is an insulator or a conductor. 

We explain physically the origin of energy gaps in the simple problem of a 
linear solid of lattice constant a. The low energy portions of the band structure 




Figure 2 (a) Plot of energy e versus wavevector k for a free electron, (b) Plot of energy versus 
wavevector for an electron in a monatomic linear lattice of lattice constant a. The energy gap E g 
shown is associated with the first Bragg reflection at k = ±ir/a\ other gaps are found at higher 
energies at ±mr/a , for integral values of n. 
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are shown qualitatively in Fig. 2, in (a) for entirely free electrons and in (b) for 
electrons that are nearly free, but with an energy gap at k = ±ir/a. The Bragg 
condition (k + G ) 2 — k 2 for diffraction of a wave of wavevector k becomes in 
one dimension 

k- ±1 G = ±nir/a , (4) 

where G = 2 mi /a is a reciprocal lattice vector and n is an integer. The first re- 
flections and the first energy gap occur at k = ± 7 r/a. The region in k space be- 
tween —7 r/a and rrta is the first Brillouin zone of this lattice. Other energy 
gaps occur for other values of the integer n. 

The wavefunetions at k = ±tt/o are not the traveling waves expU7rx/«) or 
exp(— ivx/a) of free electrons. At these special values of k the wavefunetions 
arc made up of equal parts of waves traveling to the right and to the left. When 
the Bragg reflection condition k ~ ± via is satisfied by the wavevector, a wave 
traveling to the right is Bragg-reflected to travel to the left, and vice versa. 
Each subsequent Bragg reflection will reverse the direction of travel of the 
wave. A wave that travels neither to the right nor to the left is a standing wave: 
it doesn’t go anywhere. 

The time-independent state is represented by standing waves. We can form 
I wo different standing waves from the two traveling waves 

exp(± iTrx/a ) = cos(vx/a) ± i sin(vx/a), 

so that the standing waves are 

— exp(t77x/a) + exp(— i-nx/a) = 2 cos (vx/a) ; 

(5) 

i /r(— ) = exp{ivx/a) — exp(— ivx/a) = 2 i sin (vx/a) . 

The standing waves are labeled ( + ) or ( — ) according to whether or not they 
change sign when — x is substituted for x. Both standing waves are composed 
of equal parts of right- and left-directed traveling waves. 

Origin of the Energy Gap 

The two standing waves ij/( + ) and >p{ ~ ) pile up electrons at different 
regions, and therefore the two waves have different values of the potential 
energy in the field of the ions of the lattice. This is the origin of the energy 
gap. The probability density p of a particle is i (i*ip — li/rl 2 . For a pure traveling 
wave exp(ifcx), we have p ~ exp(-ikx) cxp(ikx) — 1, so that the charge density 
is constant. The charge density is not constant for linear combinations of plane 
waves. Consider the standing wave <//( + ) in (5); for this we have 

p( + ) = |*A(+)| 2<X COS 2 77X/fl . 

This function piles up electrons (negative charge} on the positive ions centered 
at x = 0, a, 2 a, . . . in Fig. 3, where the potential energy is lowest. 
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(a) 


p, probability density 



Figure 3 (a) Variation of potential energy of a conduction electron in the field of the ion cores 
of a linear lattice, (b) Distribution of probability density p in the lattice for \ip( — )l 2 x sin 2 wx/a; 
li/'( + )l 2 oc cos 2 m/a; and for a traveling wave. The wavefunction + ) piles up electronic charge 
on the cores of the positive ions, thereby lowering the potential energy in comparison with the 
average potential energy seen by a traveling wave. The wavefunction ip( ~ ) piles up charge in 
the region between the ions, thereby raising the potential energy in comparison with that seen by 
a traveling wave. This figure is the key to understanding the origin of the energy gap. 


Figure 3a pictures the variation of the electrostatic potential energy of a 
conduction electron in the field of the positive ion cores. The ion cores bear a 
net positive charge because the atoms are ionized in the metal, with the va- 
lence electrons taken off to form the conduction band. The potential energy of 
an electron in the field of a positive ion is negative, so that the force between 
them is attractive. 

For the other standing wave ip( — ) the probability density is 
p(-) = | «/r( — ) | 2 oc sin 2 7rx/a , 

which concentrates electrons away from the ion cores. In Fig. 3b we show 
the electron concentration for the standing waves </>( + )> >f/( — ), and for a travel- 
ing wave. 

When we calculate the average or expectation values of the potential 
energy over these three charge distributions, we find that the potential energy 
of p( + ) is lower than that of the traveling wave, whereas the potential energy of 
p( — ) is higher than the traveling wave. We have an energy gap of width E g if 
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the energies of p( — ) and p( + ) differ by E g . Just below the energy gap at 
points A in Fig. 2 the wavefunction is t //( + ), and just above the gap at points B 
the wavefunction is </*(—). 


Magnitude of the Energy Gap 

The wavefunctions at the Brillouin zone boundary k = ir/a are V2 cos irx/a 
and V2 sin m/a, normalized over unit length of line. Let us suppose that the 
potential energy of an electron in the crystal at point x is 

U(x) = U cos 2m/a . 

The first-order energy difference between the two standing wave states is 


Ej, = dx U(x) [|»/'( + )| 2 - |*A{-)| 2 ] 

= 2 j " dx U cos(2m/a)(cos 2 m/a — sin 2 m/a) = U . 


( 6 ) 


We see that the gap is equal to the Fourier component of the crystal potential. 


BLOCH FUNCTIONS 

F. Bloch proved the important theorem that the solutions of the 
Schrodinger equation for a periodic potential must be of a special form: 

(7) 

where u k (r) has the period of the crystal lattice with u k (r) = u k (r + T). Here T 
is a translation vector of the lattice. The result (7) expresses the Bloch theorem: 

The eigenfunctions of the wave equation for a periodic potential are 
the product of a plane wave exp(tk * r) times a function u k (r) with the 
periodicity of the crystal lattice. 

A one-electron wavefunction of the form (7) is called a Bloch function and 
can be decomposed into a sum of traveling waves, as we see later. Bloch func- 
tions can be assembled into wave packets to represent electrons that propa- 
gate freely through the potential field of the ion cores. 

We give now a restricted proof of the Bloch theorem, valid when t/f k is 
nondegenerate; that is, when there is no other wavefunction with the same 
energy and wavevector as i// k . The general case wall be treated later. We con- 
sider N identical lattice points on a ring of length Na. The potential energy is 
periodic in a, with U(x) = U(x + sa), where s is an integer. 

Let us be guided by the symmetry of the ring to look for solutions of the 
wave equation such that 



ijj(x + a) = C(j/(x) , 


(S) 
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where C is a constant. Then, on going once around the ring, 

+ Na) = «^(x) = C s </r(r) , 

because i/dx) must be single-valued. It follows that C is one of the N roots of 
unity, or 

C = exp(i27Ts/lV) ; s = 0, 1, 2, .... A 7 - 1 . (9) 

We use (9) to see that 

ij/(x) — u k (x) exp{i2Trsx/Na) (10) 

satisfies (8), provided that u^x) has the periodicity a, so that u*(x) = u h {x + a). 
This is the Bloch result (7). 


KRONIG-PENNEY MODEL 

A periodic potential for which the wave equation can be solved in terms of 
elementary functions is the square-well array of Fig. 4. The wave equation is 

fi 2 d 2 ilf 

(11) 

where U(x) is the potential energy and e is the energy eigenvalue. 

In the region 0 < x < a in which U — 0, the eigenfunction is a linear 
combination. 


$ = Ae iKx + Be~ iKx , (12) 

of plane waves traveling to the right and to the left, with energy 

e = h 2 K?/2m . (13) 

In the region — b < x < 0 within the barrier the solution is of the form 

<P - Ce Qx + De^ 1 , (14) 

with 

U 0 - e = h z (f/2m . (15) 
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We want the complete solution to have the Bloch form (7). Thus the solu- 
tion in the region a < x < a + b must be related to the solution (14) in the 
region —b < x < 0 by the Bloch theorem: 

i//(a < x < a + h) — tj/( —b < x < 0) e ik< ' a+h ' 1 , ( 16) 

which serves to define the wavevector k used as an index to label the 
solution. 

The constants A, B, C, D are chosen so that <// and dijj/dx are continuous at 
x = 0 and x = a. These are the usual quantum mechanical boundary condi- 
tions in problems that involve square potential wells. Atx = 0, 

A + B = C + D ; (17) 

iK(A - B) — Q(C - D) , (18) 

with Q from (14). At x — a , with the use of (16) for under the barrier in 
terms of i/d— b), 

Ae iKa + Be~ iKa = (Ce~ Qb + I)e Qb ) e ik{a+b] ; { 19) 

iK{Ae iKM - Be ,Ka ) = Q(Ce Qb - De Qb ) e iklM ~ b} . (20) 

The four equations (17) to (20) have a solution only if the determinant of 
the coefficients of A, B, C, D vanishes, yielding 

[(£> 2 — K 2 )/2^)K] sinh Qb sin Ka + cosh Qb cos Ka = cos k(a + b) . (21a) 

It is rather tedious to obtain this equation. 

The result is simplified if we represent the potential by the periodic delta 
function obtained when we pass to the limit b = 0 and C/ 0 = °o in such a way 

that Q 2 ba/ 2 = P, a finite quantity. In this limit Q > K and Qb 1. Then (21a) 

reduces to 

(P/Ko)sin Ka + cos Ka — cos ka . (21b) 

The ranges of K for which this equation has solutions are plotted in Fig. 5, 
for the case P = 37r/2. The corresponding values of the energy are plotted in 
Fig. 6. Note the energy gaps at the zone boundaries. The wavevector k of the 
Bloch function is the important index, not the K in (12), which is related to the 
energy by (13). A treatment of this problem in wavevector space is given later 
in this chapter. 


WAVE EQUATION OF ELECTRON IN A PERIODIC POTENTIAL 

We considered in Fig. 3 the approximate form we expect for the solution 
of the Schrodinger equation if the wavevector is at a zone boundary', as at 
k = ±tt / a. We treat in detail the wave equation for a general potential, at general 
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e 

\ ( P/Ka ) sin Ka + cos Ka 



Figure 5 Plot of the function (P/Ka) sin Ka + cos Ka, for P = 3tt/2. The allowed values of the 
energy e are given by those ranges of Ka = (2me/h l ) vl a for which the function lies between ±1. 
For other values of the energy there are no traveling wave or Bloch-like solutions to the wave 
equation, so that forbidden gaps in the energy spectrum are formed. 


Figure 6 Plot of energy vs. wavenumber for the 
Kronig-Penney potential, with P = 3w/2. Notice 
the energy gaps at ka = tt, 2tt, 3tt ... . 



values of k. Let U(x ) denote the potential energy of an electron in a linear lattice 
of lattice constant a. We know that the potential energy is invariant under a crys- 
tal lattice translation: U(x) = U(x + a). A function invariant under a crystal lattice 
translation may be expanded as a Fourier series in the reciprocal lattice vectors 
G. We write the Fourier series for the potential energy as 

U(.x) = ^U c e iG * . (22) 

G 

The values of the coefficients U G for actual crystal potentials tend to decrease 
rapidly with increasing magnitude of G. For a bare coulomb potential U c 
decreases as 1/G 2 . 
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We want the potential energy U(x) to be a real function: 

U(x) = 2 U c (e iGx + e- iGx ) = 2 2 U c cos Gx . (23) 

G>0 on 

For convenience we have assumed tliat the crystal is symmetric about x = 0 
and that U 0 = 0. 

The wave equation of an electron in the crystal is 'Kip = eip, where is the 
hamiltonian and e is the energy eigenvalue. The solutions ip are called eigen- 
functions or orbitals or Bloch functions. Explicitly, the wave equation is 

(■ 2 + W*)') <PU) = (^P 2 + 2 u c e iGx ^j tp(x) = eipix) . (24) 

Equation (24) is written in the one-electron approximation in which the 
orbital i p(x) describes the motion of one electron in the potential of the ion 
cores and in the average potential of the other conduction electrons. 

The wavefunction ip(x) may be expressed as a Fourier series summed over 
all values of the wavevector permitted by the boundary conditions, so that 

iA - 2 C(k) , (25) 

k 

where k is real. (We could equally well write the index A as a subscript on C, as 
in C k .) 

The set of values of k has the form 27 m/L, because these values satisfy 
periodic boundary conditions over length L. Here n is any integer, positive or 
negative. We do not assume, nor is it generally true, that ip(x) itself is periodic 
in the fundamental lattice translation a. The translational properties of ip(x) 
are determined by the Bloch theorem (7). 

Not all wavevectors of the set 27 m/L enter the Fourier expansion of 
any one Bloch function. If one particular wavevector k is contained in a i p, 
then all other wavevectors in the Fourier expansion of this ip will have the 
form k + G, where G is any reciprocal lattice vector. We prove this result in 
(29) below'. 

We can label a wavefunction < p that contains a component k as ip k or, 
equally well, as i Ai +G , because if k enters the Fourier expansion then k + G 
may enter. The wavevectors k + G running over G are a restricted subset of 
tlie set 2 t m/L, as shown in Fig. 7. 

We shall usually choose as a label for the Bloch function that k which lies 
within the first Brillouin zone. When other conventions are used, we shall say 
so. This situation differs from the phonon problem for a monatomic lattice 
where there are no components of the ion motion outside the first zone. The 
electron problem is like the x-ray diffraction problem because like the electron 
wavefunction the electromagnetic field exists everywhere within the crystal 
and not only at the ions. 
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Figure 7 The lower points represent values of the wavevector k — 2-irn/L allowed by the periodic 
houndary condition on the wavefunction over a ring of circumference L composed of 20 primitive 
cells. The allowed values continue to ±°°. The upper points represent the first few waveveetors 
which may enter into the Fourier expansion of a wavefunction i^(x), starting from a particular 
wavevector k — k, t — — 8(2ir/L). The shortest reciprocal lattice vector is 2w/a — 20(27r/Z,). 


To solve the wave equation, substitute (25) in (24) to obtain a set of linear 
algebraic equations for the Fourier coefficients. The kinetic energy term is 



2 in 



ft 2 

2m dx 2 



k 2 C(k)e ikx ; 


and the potential energy term is 



<Kx) = 'Z2u G e iGx C(k)e ih . 

G k 


The wave equation is obtained as the sum: 

2 ^ k 2 C(k ) e ikx +£ 2 U G C(k) e il - k+C)x = C(k) e ikx . (26) 

k Am c k k 

Each Fourier component must have the same coefficient on both sides of the 
equation. Thus we have the central equation 


(A* - e)C(k) + ^ U a C(k - G) = 0 

G 


(27) 


with the notation 

\ k = fi"k 2 /2m . (28) 

Equation (27) is a useful form of the wave equation in a periodic lattice, 
although unfamiliar because a set of algebraic equations has taken the place of 
the usual differential equation (24). The set appears unpleasant and formida- 
ble because there are, in principle, an infinite number of C(k — G) to be de- 
termined. In practice a small number will often suffice, perhaps two or four. It 
takes some experience to appreciate the practical advantages of the algebraic 
approach. 
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Restatement of the Bloch Theorem 

Once we determine the C’s from (27), the wavcfunction (25) is given as 

4>k(x) = 2 C(* - G) e iik - G)x , (29) 

c 

which may be rearranged as 

<h(x) = (X C(k - G) e~ iG ^j e tbr = e ik ’u k (x) , 
with the definition 

u k (x)^C(k-C)e- iG \ 

G 

Because u k (x) is a Fourier series over the reciprocal lattice vectors, it is in- 
variant under a crystal lattice translation T, so that u k (x ) = u k (x + T). We verify 
this directly by evaluating u k (x + T ): 

u k (x + T) - 2 c ( k - G)e ,CixiT] = e“ iG ' J '[2 C(Jt - G) e~ iCl ] = e~ iGT u k (x) . 

Because exp(“iGT) = 1 by (2.17), it follows that u k {x + T) — u k (x), thereby 
establishing the periodicity of u k . This is an alternate and exact proof of the 
Bloch theorem and is valid even when the >jf k are degenerate. 

Crystal Momentum of an Electron 

What is the significance of the wavevector k used to label the Bloch func- 
tion? It has several properties: 

• Under a crystal lattice translation which carries r to r + T we have 

<A k ( r + T) = e il V kr u k (r + T) - e ik ‘ T </'i(r) , (30) 

because u k (r + T) = u k (r). Thus exp(ik • T) is the phase factor by which a 
Bloch function is multiplied when we make a crystal lattice translation T. 

• If the lattice potential vanishes, the central equation (27) reduces to 
(A k — e)C(k) = 0, so that all C(k — G) are zero except C(k), and thus u k (r) 
is constant. We have i/f k (r) — e ,k r , just as for a free electron. (This assumes 
we have had the foresight to pick the “right” k as the label. For many pur- 
poses other choices of k, differing by a reciprocal lattice vector, will be more 
convenient.) 

• The quantity k enters in the conservation laws that govern collision processes 
in crystals. (The conservation laws are really selection rules for transitions.) 
Thus fik is called the crystal momentum of an electron. If an electron k 
absorbs in a collision a phonon of wavevector q, the selection rule is k + q = 
k' + G. In this process the electron is scattered from a state k to a state k', 
with G a reciprocal lattice vector. Any arbitrariness in labeling the Bloch func- 
tions caii be absorbed in the G without changing the physics of the process. 
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Solution of the Central Equation 

The central equation (27), 

(At - e)C(k) + ^ U c C(k - G) = 0 , (31) 

c 

represents a set of simultaneous linear equations that connect the coefficients 
C(k — G) for all reciprocal lattice vectors G. It is a set because there are as 
many equations as there are coefficients C. These equations are consistent if 
the determinant of the coefficients vanishes. 

Let us write out the equations for an explicit problem. We let g denote the 
shortest G. We suppose that the potential energy U(x) contains only a single 
Fourier component U g = U_ g , denoted by U. Then a block of the determinant 
of the coefficients is given by: 

\ k -2g-e U 0 0 0 

U h-g-e U 0 0 

0 U A* - € U 0 (32) 

0 0 U Ajt + g — e U 

0 0 0 U A i+2g -e 

To see this, write out five successive equations of the set (31). The determi- 
nant in principle is infinite in extent, but it will often be sufficient to set equal 
to zero the portion we have shown. 

At a given k. each root e or e k lies on a different energy' band, except in 
case of coincidence. The solution of the determinant (32) gives a set of energy 
eigenvalues e nk , where n is an index for ordering the energies and k is the 
wavevector that labels C k . 

Most often k will be taken in the first zone, to reduce possible confusion in 
the labeling. If we chose a k different from the original by some reciprocal 
lattice vector, we would have obtained the same set of equations in a different 
order — but having the same energy spectrum. 

Kronig-Penney Model in Reciprocal Space 

As an example of the use of the central equation (31) for a problem that is 
exactly solvable, we use the Kronig-Penney model of a periodic delta-function 
potential: 


U(x) — 2 ^ U c cos Gx — A«2 <5(x — sa) , 

OO ' V 


(33) 


where A is a constant and a the lattice spacing. The sum is over all integers s 
between 0 and 1 la. The boundary conditions are periodic over a ring of unit 
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length, which means over 1/a atoms. Thus the Fourier coefficients of the 
potential are 

U G = dx U(x) cos Gx = Aa 2 dx 5{x — sa ) cos Gx 

J o , Jo ( 34 ) 

=■ cos Gsa = A . 

S 

All U G are equal for the delta-function potential. 

We write the central equation with k as the Bloch index. Thus (31) 
becomes 

(A* — e)C(fc) + A^ C{k — 2xmla) — 0 , (35) 

n 

where A t = h 2 k 2 /2m and the sum is over all integers n. We want to solve (35) 
for e(k). 

We define 


so that (35) becomes 


f(k) — X C(k — Zmi/a) 


C(k) = - ( ^ nA/h ^ f ^ 


k 2 — (2 me/h 2 ) 

Because the sum (36) is over all coefficients C, we have, for any n, 

f(k) = f(k - 2im/a) . 

This relation lets us write 

C(k — limia) = — (2 mA/h 2 )f(k)[(k — 2im/a) 2 — 2 me/h 2 )] 1 


(36) 


(37) 


(38) 

(39) 


We sum both sides over all n to obtain, using (36) and cancelling f(k) from 
both sides. 


( h 2 /2mA ) = — ^ [(k — 2im/a) 2 — (2 me/h 1 )] 1 . (40) 

n 

The sum can be carried out with the help of the standard relation 


ctn x = 2 

n 


1 

JT7T + X 


(41) 


After trigonometric manipulations in which we use relations for the difference 
of two cotangents and the product of two sines, the sum in (40) becomes 


a 2 sin Ka 

4Ka(cos ka — cos Ka) 


where we write K 2 = 2 me/h 2 as in (13). 


(42) 
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The final result for (40) is 

(mAa 2 /2h 2 )(Ka)~ 1 sin Ka + cos Ka = cos ka , (43) 

which agrees with the Kronig-Penney result (21b) with P written for mAa i :2fr. 

Empty Lattice Approximation 

Actual band structures are usually exhibited as plots of energy versus 
wavevector in the first Brillouin zone. When wavevectors happen to be given 
outside the first zone, they are carried hack into the first zone by subtracting a 
suitable reciprocal lattice vector. Such a translation can always be found. The 
operation is helpful in visualization. 

When band energies are approximated fairly well by free electron ener- 
gies e k = fi 2 \c!2m : it is advisable to start a calculation by carrying the free elec- 
tron energies back into the first zone. The procedure is simple enough once 
one gets the hang of it. We look for a G such that a k' in the first zone satisfies 

k ' + G = k , 

where k is unrestricted and is the true free electron wavevector in the empty 
lattice. (Once the plane wave is modulated by the lattice, there is no single 
“true” wavevector for the state i//.) 

If we drop the prime on k' as unnecessary baggage, the free electron 
energy can always be written as 

e(k x jcy,k z ) — (h 2 /2m)(k + G) 2 

= W2m)[(k x + G x f + (k y + G/ + (k z + G z ) 2 ] , 

with k in the first zone and G allowed to run over the appropriate reciprocal 
lattice points. 

We consider as an example the low-lying free electron bands of a simple 
cubic lattice. Suppose we want to exhibit the energy as a function of k in the 
[100] direction. For convenience, choose units such that h 2 /2m = 1. We show 
several low-lying bands in this empty' lattice approximation with dieir energies 
e(000) at k = 0 and e(kjOO) along the k x axis in the first zone: 


Band 

Ga/27T 

e(000) 

e(k s 00) 

i 

000 

0 


2,3 

100,100 

(277/a) 2 

(k x ± 2ir/a) 2 

4, 5, 6, 7 

010,010,001,001 

(2n/a) 2 

kf + {2ir/af 

8,9,10,11 

110,101,110,101 

2(27 T/a) 2 

(k x + 277/a) 2 + (277/a) 2 

12,13,14,15 

110,101,110,101 

2{2ir/af 

(. k x — 2? r/a) 2 + (2tt la) 2 

16,17,18,19 

011,011,011,011 

2(27r/a) 2 

k \ + 2(2it/<z) 2 
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Figure 8 Low-lying free electron energy bands 
of the empty sc lattice, as transformed to the first 
Brillouin zone and plotted vs. (£ x 00). The free 
electron energy is h 2 {k + G) 2 /2 m, where the G s 
are given in the second column of the table. The 
bold curves are in the first BriUouin zone, with 
— 7r/a ^ k x ^ rr/a. Energy bands drawn in this 
way are said to be in the reduced zone scheme. 


These free electron bands are plotted in Fig. 8. It is a good exercise to plot the 
same bands for k parallel to the [111] direction of wavevector space. 

Approximate Solution Near a Zone Boundary 

We suppose that the Fourier components U c of the potential energy are 
small in comparison with the kinetic energy of a free electron at the zone 
boundary. We first consider a wavevector exactly at the zone boundary at § G, 
that is, at tt la. Here 

k 2 = (| G) 2 ; (k - G) 2 = (;G- Gf = (fG) 2 , 

so that at the zone boundary the kinetic energy of the two component waves 
k = ±|G are equal. 

If C'(nG) is an important coefficient in the orbital (29) at the zone boundary, 
then C(~zG) is also an important coefficient. This result also follows from the 
discussion of (5). We retain only those equations in the central equation that 
contain both coefficients C(|G) and C(—\G), and neglect all other coefficients. 
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One equation of (31) becomes, with k = §G and A = h 2 ({;G) 2 /2m, 

(A - e)C(| C) + UC(~\G) = 0 . (44) 

Another equation of (31) becomes, with k = |G, 

(A - e)C(— | G) + 1/C(3 G) = 0 . (45) 

These two equations have nontrivial solutions for the two coefficients if 
the energy e satisfies 

A “ e = 0, (46) 

U A — e 

whence 


(A e = \ ± U ~ ^~(l G) 2 ± 1/ . (47) 

The energy lias two roots, one lower than the free electron kinetic energy by 
[/, and one higher by U. Thus the potential energy 2U cos Gx has created an 
energy gap 2U at the zone boundary'. 

The ratio of the C’s may be found from either (44) or (45): 


G( 2 G) _ e — A _ 
C(\G) V 


(48) 


where the last step uses (47). Thus the Fourier expansion of i//(.y) at the zone 
boundary has the two solutions 

i l>(x) — exp(iGx/2) ± exp( —iC.t/2) . 

These orbitals are identical to (5). 

One solution gives the wavefunction at the bottom of the energy gap; the 
other gives the wavefunction at the top of the gap. Which solution has the 
lower energy' depends on the sign of U. 

We now solve for orbitals with wavevector k near the zone boundary |G. 
We use the same two-component approximation, now with a wavefunction of 
the form 


i/f(x) = C(k ) e ikx + C(k - G) e ilk ~ G)x . (49) 

As directed by the central equation (31), we solve the pair of equations 

(At - e)C(lt) + UC(k - G) = 0 ; 

(Afc-c ~e)C(k -G) + UC(k ) = 0 , 
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with A;- defined as h s k 2 /2m. These equations have a solution if the energy e 
satisfies 



whence e 2 — e(A t _ G + A k ) + A t _ G A U 2 — 0. 

The energy has two roots: 

« = i (A* - o + A*) ± [£ (A* _ c - A*) 2 + C/T 2 , (50) 

and each root describes an energy band, plotted in Fig. 9. It is convenient to 
expand the energy in terms of a quantity K (the mark over the K is called a 
tilde), which measures the difference K = k — §G in wavevector between k 
and the zone boundary: 

e k = (h 2 /2m)(\G 2 +K 2 ) ± [4A(hW/2 m) + lf] m 

- (h 2 /2m){lG z + K 2 ) ± U[1 + 2(A/U i )(ti 2 k 2 /2m)] , (51) 

in the region h 2 GKJ2m <s' |[/|. Here A = (fi 2 /2m)(| G) 2 as before. 

Writing the two zone boundary roots of (47) as e(±), we may write (51) as 



Figure 9 Solutions of (50) in the periodic zone scheme, in the region near a boundary of the first 
Brillouin zone. The units are such that U = — 0.45, G = 2, and fi z /m = 1. The free electron curve is 
drawn for comparison. The energy gap at the zone boundary is 0.90. The value of U has deliberately 
been chosen large for this illustration, too large for the two-term approximation to be accurate. 
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Figure 10 Ratio of the coefficients in i !>(x) = C(k) exp(ifct) + C(k - G) exp[i(fc — G)x] as calcu- 
lated near the boundary of the first Brillouin zone. One component dominates as we move away 
from the boundary. 


These are the roots for the energy when the wavevector is very' close to the 
zone boundary at \G. Note the quadratic dependence of the energy on the 
wavevector K. For U negative, the solution e( — ) corresponds to the upper of 
the two bands, and e( + ) to the lower of the two bands. The two C's are plotted 
in Fig. 10. 


NUMBER OF ORBITALS IN A BAND 


Consider a linear crystal constructed of an even number N of primitive 
cells of lattice constant a. In order to count states we apply periodic boundaiy 
conditions to the wavefunctions over the length of the crystal. The allowed 
values of the electron wavevector k in the first Brillouin zone are given by (2): 


jfc = 0 ; 




; L 


(53) 


We cut the series off at Ntt/L — tt / a, for this is the zone boundary. The point 
—Ntt/L = — ir/a is not to be counted as an independent point because it is 
connected by a reciprocal lattice vector with ir/a. The total number of points is 
exactly 7 N, the number of primitive cells. 

Each primitive cell contributes exactly one independent value of k 
to each energy band. This result carries over into three dimensions. With 
account taken of the two independent orientations of the electron spin, there 
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are 2N independent orbitals in each energy band. If there is a single 
atom of valence, one in each primitive cell, the band can be half filled with 
electrons. If each atom contributes two valence electrons to the band, the 
band can be exactly filled. If there are two atoms of valence, one in each prim- 
itive cell, the band can also be exactly filled. 

Metals and Insulators 

If the valence electrons exactly fill one or more bands, leaving others 
empty, the cry stal wall be an insulator. An external electric field will not cause 
current flow in an insulator. (We suppose that the electric field is not strong 
enough to disrupt the electronic structure.) Provided that a filled band is sepa- 
rated by an energy' gap from the next higher band, there is no continuous way 
to change the total momentum of the electrons if every accessible state is 
filled. Nothing changes when the field is applied. This is quite unlike the situa- 
tion for free electrons for which k increases uniformly in a field (Chapter 6). 

A ervstal can be an insulator onlv if the number of valence electrons in a 

J 

primitive cell of the cry stal is an even integer. (An exception must be made for 
electrons in tightly bound inner shells which cannot be treated by band 
theory.) If a crystal has an even number of v alence electrons per primitive cell, 
it is necessary' to consider whether or not the bands overlap in energy. If the 
bands ov erlap in energy', then instead of one filled band giving an insulator, we 
can have two partly' filled bands giving a metal (Fig. 11). 

The alkali metals and the noble metals have one valence electron per 
primitive cell, so that they' have to be metals. The alkaline earth metals have 
two valence electrons per primitive cell; they could be insulators, but the 
bands overlap in energy' to give metals, but not very good metals. Diamond, 
silicon, and germanium each have two atoms of valence four, so that there are 





(a) 


(b) 


(c) 


Figure 11 Occupied states and band structures giving (a) an insulator, (b) a metal or a semimetal 
because of band overlap, and (c) a metal because of electron concentration. In (b) the overlap 
need not occur along the same directions in the Brillouin zone. If the overlap is small, with rela- 
tively few states involved, we speak of a semimetal. 
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eight valence electrons per primitive cell; the bands do not overlap, and the 
pure crystals are insulators at absolute zero. 


SUMMARY 

• The solutions of the wave equation in a periodic lattice are of the 
Bloch form i// k (r) = e ik r « k (r), where « k (r) is invariant under a crystal lattice 
translation. 

• There are regions of energy for which no Bloch function solutions of the 
wave equation exist (see Problem 5). These energies form forbidden regions 
in which the wavefunctions are damped in space and the values of the k’s are 
complex, as pictured in Fig. 12. The existence of forbidden regions of energy 
is prerequisite to the existence of insulators. 

• Energy bands may often be approximated by one or two plane waves; for 
example, (/^(x) = C{k)e' kx + C(k — G)e' <k ~ G>x near the zone boundary at §G. 

• The number of orbitals in a band is 2 N, where N is the number of primitive 
cells in the specimen. 


Problems 

1. Square lattice, free electron energies, (a) Show for a simple square lattice (two 
dimensions) that the kinetic energy of a free electron at a corner of the first zone is 
higher than that of an electron at midpoint of a side face of the zone by a factor of 2 . 

(b) What is the corresponding factor for a simple cubic lattice (three dimensions)? 

(c) What bearing might the result of (b) have on the conductivity 2 3 4 * * 7 of divalent metals? 

2. Free electron energies in reduced zone. Consider the free electron energy 7 bands 

of an fee crystal lattice in the approximation of an empty lattice, but in the reduced 
zone scheme in which all k' s are transformed to lie in the first Brillouin zone. Plot 
roughly in the [ 111 ] direction the energies of all bands up to six times the lowest 
band energy at the zone boundary at k = (2ir /a)(|, 5 ). Let this be the unit of en- 

ergy. This problem shows why band edges need not necessarily be at the zone cen 
ter. Several of the degeneracies (band crossings) will be removed when account is 
taken of the crystal potential. 

3. Kronig-Penney model, (a) For the delta-function potential and with F -V 1, find at 
k — 0 the energy of the lowest energy band, (b) For the same problem find the band 
gap at k = v/a. 

4. Potential energy in the diamond structure, (a) Show that for the diamond struc- 

ture the Fourier component U c of the crystal potential seen by an electron is equal 

to zero for G = 2A, where A is a basis vector in the reciprocal lattice referred to the 

conventional cubic cell, (b) Show that in the usual first-order approximation to 
the solutions of the wave equation in a periodic lattice the energy 7 gap vanishes at 
the zone boundary plane normal to the end of the vector A. 



7 Energy Bands 


183 



Figure 12 In the energy gap there exist solutions of the wave equation for complex values of the 
wavevector. At the boundary of the first zone the real part of the wavevector is jG. The imaginary 
part of k in the gap is plotted in the approximation of two plane waves, for U — 0.01 h 5 G a /2m. In an 
infinite unbounded crystal the wavevector must be real, or else the amplitude will increase with- 
out limit. But on a surface or at a junction there can exist solutions with complex wavevector. 


*5. Complex wavevectors in the energy gap. Find an expression for the im agin art' 
part of the wavevector in the energy gap at the boundary of the first Brillouin zone, 
in the approximation that led to Eq. (46). Give the result for the Im(fe) at the center 
of the energy gap. The result for small Im(fc) is 

(fi 2 /2m)[Im(Jfc)] 2 « 2mir 1 /h 1 C, 1 . 

The form as plotted in Fig. 12 is of importance in the theory of Zener tunneling 
from one band to another in the presence of a strong electric field. 

6 . Square lattice. Consider a square lattice in two dimensions with the crystal potential 
U(x,y) = —4(7 cos(2-nx/a) cos(2my/a) . 

Apply the central equation to find approximately the energy gap at the comer 
point (wh, 7 r/fl) of the Brillouin zone. It will suffice to solve a 2 X 2 determinantal 
equation. 


‘This problem is somewhat difficult. 
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Figure 1 Carrier concentrations for metals, semimetals, and semiconductors. The semiconductor 
range may be extended upward by increasing the impurity concentration, and the range can be ex- 
tended downward to merge eventually with the insulator range. 



CHAPTER 8: SEMICONDUCTOR CRYSTALS 


Carrier concentrations representative of metals, semimetals, and semicon- 
ductors are sir own in Fig. 1. Semiconductors are generally classified by their 
electrical resistivity at room temperature, with values in the range of 10 " to 
10 9 ohm-cm, and strongly dependent on temperature. At absolute zero a pure, 
perfect crystal of most semiconductors will be an insulator, if we arbitrarily de- 
fine an insulator as having a resistivity above 10 14 ohm-cm. 

Devices based on semiconductors include transistors, switches, diodes, 
photovoltaic cells, detectors, and thermistors. These may be used as single 
circuit elements or as components of integrated circuits. We discuss in this 
chapter the central physical features of the classical semiconductor crystals, 
particularly silicon, germanium, and gallium arsenide. 

Some useful nomenclature: the semiconductor compounds of chemical 
formula AS, where A is a trivalent element and B is a pentavalent element, are 
called 11I-V (three-five) compounds. Examples are indium antimonide and 
gallium arsenide. Where A is divalent and B is hexavalent, the compound is 
called a II-VI compound; examples are zinc sulfide and cadmium sulfide. Silicon 
and germanium are sometimes called diamond- type semiconductors, because 
they have the crystal structure of diamond. Diamond itself is more an insulator 
rather than a semiconductor. Silicon carbide SiC is a IV-IV compound. 

A highly purified semiconductor exhibits intrinsic conductivity, as distin- 
guished from the impurity conductivity of less pure specimens. In the intrin- 
sic temperature range the electrical properties of a semiconductor are not 
essentially modified by impurities in the crystal. An electronic band scheme 
leading to intrinsic conductivity is indicated in Fig. 2. The conduction band is 
vacant at absolute zero and is separated by an energy gap £ g from the filled 
valence band. 

The band gap is the difference in energy' between the lowest point of the 
conduction band and the highest point of the valence band. The lowest point 
in the conduction band is called the conduction band edge; the highest 
point in the valence band is called the valence band edge. 

As the temperature is increased, electrons are thermally excited from the 
valence band to the conduction band (Fig. 3). Both the electrons in the con- 
duction band and the vacant orbitals or holes left behind in the valence band 
contribute to the electrical conductivity. 


BAND GAP 


The intrinsic conductivity' and intrinsic carrier concentrations are largely 
controlled by Eg/k B T, the ratio of the band gap to the temperature. When this 
ratio is large, the concentration of intrinsic carriers will be low and the 
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Figure 2 Band scheme for intrinsic conductivity in a semiconductor. At 0 K the conductivity is 
zero because all states in the valence band are filled and all states in the conduction band are va- 
cant. As the temperature is increased, electrons are thermally excited from the valence band to the 
conduction band, where they become mobile. Such carriers are called “intrinsic.” 



Figure 3 Intrinsic electron concentration as a function of temperature for (a) germanium and 
(b) silicon. Under intrinsic conditions the hole concentration is equal to the electron concentra- 
tion. The intrinsic concentration at a given temperature is higher in Ge than in Si because the 
energy gap is narrower in Ge (0.66 eV) than in Si (1.11 eV). (After W. C. Dunlap.) 


conductivity will be low. Band gaps of representative semiconductors are given 
in Table 1. The best values of the band gap are obtained by optical absorption. 

In a direct absorption process the threshold of continuous optical ab- 
sorption at frequency a> g measures the band gap E„ = h<o g as shown in Figs. 4a 
and 5a. A photon is absorbed by the crystal with the creation of an electron 
and a hole. 

In the indirect absorption process in Figs. 4b and 5b the minimum 
energy gap of the band structure involves electrons and holes separated by a 
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CRYSTAL WITH DIRECT GAP CRYSTAL WITH INDIRECT GAP 


Absorption 



Photon energy h(o — *• 
(a) 


Absorption 



Photon energy hco — *■ 

(b) 


Figure 4 Optical absorption in pure insulators at absolute zero. In (a) the threshold determines 
the energy gap as E g = fko g . In (b) the optical absorption is weaker near the threshold: at 
ha) = E„ + Ml a photon is absorbed with the creation of three particles: a free electron, a free 
hole, and a phonon of energy Ml. In (b) the energy E vert marks the threshold for the creation of a 
free electron and a free hole, with no phonon involved. Such a transition is called vertical; it is 
similar to the direct transition in (a). These plots do not show absorption lines that sometimes are 
seen lying just to the low energy side of the threshold. Such lines are due to the creation of a 
bound electron-hole pair, called an exciton. 


e € 



Figure 5 In (a) the lowest point of the conduction band occurs at the same value of k as the highest 
point of the valence band. A direct optical transition is drawn vertically with no significant change of 
k, because the absorbed photon has a very small wavevector. The threshold frequency a t g for absorp- 
tion by the direct transition determines the energy gap E g = h<a R . The indirect transition in (b) in- 
volves both a photon and a phonon because the band edges of the conduction and valence bands are 
widely separated in k space. The threshold energy for the indirect process in (b) is greater than the 
true band gap. The absorption threshold for the indirect transition between the band edges is at 
fUo = £ g + hSl, where fl is the frequency of an emitted phonon of wavevector K = — k g . At higher 
temperatures phonons are already present; if a phonon is absorbed along with a photon, the thresh- 
old energy is fun = E g — Ml. Note: The figure shows only the threshold transitions. Transitions occur 
generally between almost all points of the two bands for which the wavevectors and energy can be 
conserved. 



Table 1 Energy- gap between the valence and conduction bands 
(t = indirect gap; d — direct gap) 


Crystal 

Gap 


eV 

Crystal 

Gap 


eV 

OK 

300 K 

0K 

300 K 

Diamond 

i 

5.4 


SiC(hex) 

i 

3.0 



Si 

i 

1.17 

111 

Tc 

d 

0.33 

— 

Ge 

i 

0.744 

0.66 

HgTe‘ 

d 

-0.30 


«Sn 

d 

0.00 

0.00 

PbS 

d 

0.286 

0.34-0.37 

InSb 

d 

0.23 

0.17 

PbSe 

i 

0.165 

0.27 

InAs 

A 

0.43 

0.36 

PbTe 

i 

0.190 

0.29 

InP 

d 

1.42 

1.27 

CdS 

d 

2.582 

2.42 

GaP 

i 

2.32 

2.25 

CdSe 

d 

1.840 

1.74 

GaAs 

d 

1.52 

1.43 

CdTe 

d 

1.607 

1.44 

GaSb 

d 

0.81 

0.68 

SnTe 

d 

0.3 

0.18 

AlSb 

i 

1.65 

1.6 

Cu 2 0 

d 

2.172 

— 


11/ IV is a semimetal: the bands overlap. 


substantial wavevector k f . Here a direct photon transition at the energy of the 
minimum gap cannot satisfy the requirement of conservation of wavevector, 
because photon wavevectors are negligible at the energy range of interest. But 
if a phonon of wavevector K and frequency fl is created in the process, then 
we can have 


k(photon) = kj. + K = 0 ; fioi = E g + Ml , 

as required by the conservation laws. The phonon energy Ml wall generally be 
much less than E g : a phonon even of high wavevector is an easily accessible 
source of crystal momentum because the phonon energies are characteristi- 
cally small (--0.01 to 0.03 eV) in comparison with the energy gap. If the tem- 
perature is high enough that the necessary phonon is already thermally excited 
in the crystal, it is possible also to have a photon absorption process in which 
the phonon is absorbed. 

The band gap may also be deduced from the temperature dependence 
of the conductivity or of the carrier concentration in the intrinsic range. The 
carrier concentration is obtained from measurements of the Hall voltage 
(Chapter 6), sometimes supplemented by conductivity' measurements, Optical 
measurements determine whether the gap is direct or indirect. The band 
edges in Ge and in Si are connected by indirect transitions; the band edges in 
InSb and GaAs are connected by a direct transition (Fig. 6). The gap in a- Sri is 
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Figure 6 Optical absorption in pure indium antimonide, InSb. The transition is direct because 
both conduction and valence band edges are at the center of the Brillouin zone, k = 0. Notice the 
sharp threshold. (After G. W Gobeli and H, Y. Fan.) 


direct and is exactly zero; HgTe and HgSe are semiinetals and have negative 
gaps — the conduction and valence bands overlap. 


EQUATIONS OF MOTION 

We derive the equation of motion of an electron in an energy band. We 
look at the motion of a wave packet in an applied electric field. Suppose that 
the wave packet is made up of wavefunctions assembled near a particular 
wavevector k. The group velocity by definition is u„ — dco/dk. The frequency as- 
sociated with a wavefunction of energy e by quantum theory is w = e/h , and so 

v g — h~ 1 de/dk or v = ft -1 V k e(k) . (1) 

The effects of the crystal on the electron motion are contained in the disper- 
sion relation e(k). 
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The work Se done on the electron by the electric field E in the time 
interval St is 


Se = —eEv g St . (2) 

We observe that 

Se = ( de/dk)Sk = hv g 8k , (3) 

using (1). On comparing (2) with (3) we have 

8k = —{eE/h)8t , (4) 


whence hdk/dt = —eE. 

We may write (4) in terms of the external force F as 



(5) 


This is an important relation: in a crystal hdk/dt is equal to the external force 
on the electron. In free space d{m\)/dt is equal to the force. We have not over- 
thrown Newton’s second law of motion: the electron in the crystal is subject to 
forces from the crystal lattice as well as from external sources. 

The force term in (5) also includes the electric field and the Lorentz force 
on an electron in a magnetic field, under ordinary conditions where the mag- 
netic field is not so strong that it breaks down the band structure. Thus the 
equation of motion of an electron of group velocity v in a constant magnetic 
field B is 


(CGS) B ; 

dt c 


(SI) h~= —ev X B 
dt 


( 6 ) 


where the right-hand side of each equation is the Lorentz force on the electron. 
With the group velocity v = ft _1 grad k e, the rate of change of the wavevector is 

(CGS) g V k e X B ; (SI) || = - J V k t X B (7) 

where now both sides of the equation refer to the coordinates in k space. 

We see from the vector cross-product in (7) that in a magnetic field 
an electron moves in k space in a direction normal to the direction of the gra- 
dient of the energy e, so that the electron moves on a surface of constant 
energy. The value of the projection k B of k on B is constant during the 
motion. The motion in k space is on a plane normal to the direction of B, and 
the orbit is defined by the intersection of this plane with a surface of constant 
energy. 
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Physical Derivation of ftk = F 

We consider the Bloch eigenfunction i (i k belonging to the energy eigen- 
value e k and wavevector k: 

tp k = X C(k + G) exp[i(k + G) • r] . (8) 

G 

The expectation value of the momentum of an electron in the Bloch state k is 
P el = (k|— i*V|k) = 2 ft(k + G)|C(k + G)| 2 = M k + 2 G|C(k + G)| 2 ) , 

G G 

(9) 


using 2|C(k + G)| 2 = 1. 

We examine the transfer of momentum between the electron and the lat- 
tice when the state k of the electron is changed to k + Ak by the application 
of an external force. We imagine an insulating crystal electrostatically neutral 
except for a single electron in the state k of an otherwise empty band. 

We suppose that a weak external force is applied for a time interval such 
that the total impulse given to the entire crystal system is J — /F dt. If the 
conduction electron were free (m* = m ) . the total momentum imparted to 
the crystal system by the impulse would appear in the change of momentum of 
the conduction electron: 


J = Ap t „t = Ap d = ftAk . (10) 

The neutral crystal suffers no net interaction with the electric field, either 
directly or indirectly through the free electron. 

If the conduction electron interacts with the periodic potential ot the crys- 
tal lattice, we must have 

J = Ap, 0l = Ap ht + Ap„i . (11) 

From the result (9) for p el we have 

A Pe , = f>A k + 2 fiG[(V k |C(k + G)| 2 ) • Ak] . (12) 

G 

The change Ap|., t in the lattice momentum resulting from the change of 
state of the electron may be derived by an elementary physical consideration. 
An electron reflected by the lattice transfers momentum to the lattice. If an 
incident electron with plane wave component of momentum ftk is reflected 
with momentum A(k -I- G), the lattice acquires the momentum — hG, as re- 
quired by momentum conservation. The momentum transfer to the lattice 
when the state ipv goes over to ^ Ak is 

A Plat = G[(V k |C(k + G)| 2 • Ak] , 

G 


( 13 ) 



because the portion 

V k |C(k + G)| i. 2 *Ak (14) 

of each individual component of the initial state is reflected during the state 
change Ak. 

The total momentum change is therefore 

Ap„i + Ap )ilt = J = ftAk , (15) 

exactly as for free electrons, Eq. (10). Thus from the definition of J, we have 

fidk/dt = F , (16) 

derived in (5) by a different method. A rigorous derivation of (16) by an en- 
tirely different method is given in Appendix E. 

Holes 

The properties of vacant orbitals in an otherwise filled band are important 
in semiconductor physics and in solid state electronics. Vacant orbitals in a 
band are commonly called holes, and without holes there would be no transis- 
tors. A hole acts in applied electric and magnetic fields as if it has a positive 
charge +e. The reason is given in five steps in the boxes that follow. 


i. h = ~K ■ (17) 

The total wavevector of the electrons in a filled band is zero: 2k = 0, 
where the sum is over all states in a Brillouin zone. This result follows 
from the geometrical symmetry of the Brillouin zone: every fundamental 
lattice type has symmetry under the inversion operation r— > — r about 
any lattice point; it follows that the Brillouin zone of the lattice also has 
inversion symmetry. If the band is filled all pairs of orbitals k and — k are 
filled, and the total wavevector is zero. 

If an electron is missing from an orbital of wavevector k ( , the total 
wavevector of the system is — k c and is attributed to the hole. This result 
is surprising: the electron is missing from k e and the position of the hole 
is usually indicated graphically as situated at k„, as in Fig. 7. But the true 
wavevector lq, of the hole is — k c , which is the wavevector of the point G 
if the hole is at E. The wavevector — k e enters into selection rules for 
photon absorption. 

The hole is an alternate description of a band with one missing elec- 
tron, and we either say that the hole has wavevector — k s , or that the band 
with one missing electron has total wavevector — k e . 
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Figure 7 Absorption of a photon of energy hoi and negligible wavevector takes an electron from 
E in the filled valence band to Q in the conduction band. If k„ was the wavevector of the electron 
at £, it becomes the wavevector of the electron at Q. The total wavevector of the valence band 
after the absorption is — k^, and this is the wavevector we must ascribe to the hole if we describe 
the valence band as occupied by one hole. Thus k (l = — k c ; the wavevector of the hole is the same 
as the wavevector of the electron which remains at G. For the entire system the total wavevector 
after the absorption of the photon is k, + k (l = 0, so that the total wavevector is unchanged by the 
absorption of the photon and the creation of a free electron and free hole. 


2. e h (k h ) = — e e (k e ) . (18) 

Here the zero of energy of the valence band is at the top of the band. 
The lower in the band the missing electron lies, the higher the energy of 
the system. The energy of the hole is opposite in sign to the energy of 
the missing electron, because it takes more work to remove an electron 
from a low orbital than from a high orbital. Thus if the band is symmet- 
ric, 1 e e (k e ) = e e (-k e ) = -e ;i (-k e )= -e h (k h ). We construct in Fig. 8 a 
band scheme to represent the properties of a hole. This hole band is a 
helpful representation because it appears right side up. 

3. v fc = v e . (19) 

The velocity of the hole is equal to the velocity of the missing electron. 
From Fig. 8 we see that Ve/,(k h ) = Ve e (fc e ), so that Vh(k h ) = v e (k e ). 


1 Bands are always symmetric under the inversion k — » — k if the spin-orbit interaction is 
neglected. Even with spin-orbit interaction, bands are always symmetric if the crystal structure 
permits the inversion operation. Without a center of symmetry, but with spin-orbit interaction, the 
bands are symmetric if we compare subbands for which the spin direction is reversed: e(k, T) = 
e(—k, f). See QTS, Chapter 9. 
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Figure 8 The upper half of the figure shows the hole band that simulates the dynamics of a hole, 
constructed by inversion of the valence band in the origin. The wavevector and energy of the hole 
are equal, but opposite in sign, to the wavevector and energy of the empty electron orbital in the va- 
lence band. We do not show the disposition of the electron removed from the valence band at k„. 


4. m h = -m e . (20) 

We show below that the effective mass is inversely proportional to the 
curvature d 2 e/dk 2 , and for the hole band this has the opposite sign to that 
for an electron in the valence band. Near the top of the valence band m e 
is negative, so that m h is positive. 

5. n W = e{ E+ |v;,XB). (21) 

This comes from the equation of motion 

r/k l 

(CGS) h-r = -e(E +^v e X B) (22) 

dt v 

that applies to the missing electron when we substitute — k ; , for k ( , and v h 

for v c . The equation of motion for a hole is that of a particle of 

positive charge e. The positive charge is consistent with the electric 
current carried by the valence band of Fig. 9: the current is carried by 
the unpaired electron in the orbital G: 

j = (-e)v(G) =(—<?)[— v(E)] = e\(E) , (23) 

which is just the current of a positive charge moving with the velocity as- 
cribed to the missing electron at E. The current is shown in Fig. 10. 
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(a) (b) (c) 


Figure 9 (a) At t = 0 all states are filled except F at the top of the band; the velocity v x is zero at F 
because de/dk x = 0. (b) An electric field E x is applied in the +x direction. The force on the elec- 
trons is in the —k x direction and all electrons make transitions together in the —k x direction, mov- 
ing the hole to the state E. (c) After a further interval the electrons move farther along in k space 
and the hole is now at D. 



Figure 10 Motion of electrons in the conduction band and 
holes in the valence band in the electric field E. The hole 
and electron drift velocities are in opposite directions, but their 
electric currents are in the same direction, the direction of the 
electric field. 


Effective Mass 

When we look at the energy-wavevector relation e = (h 2 /2m)k 2 for free 
electrons, we see that the coefficient of k 2 determines the curvature of e versus 
k. Turned about, we can say that 1/m, the reciprocal mass, determines the cur- 
vature. For electrons in a band there can be regions of unusually high curva- 
ture near the band gap at the zone boundary, as we see from the solutions in 
Chapter 7 of the wave equation near the zone boundary. If the energy gap is 
small in comparison with the free electron energy A at the boundary, the cur- 
vature is enhanced by the factor A/E g . 

In semiconductors the band width, which is like the free electron energy, 
is of the order of 20 eV, while the band gap is of the order of 0.2 to 2 eV. Thus 
the reciprocal mass is enhanced by a factor 10 to 100, and the effective mass is 
reduced to 0.1-0.01 of the free electron mass. These values apply near the 
band gap; as we go away from the gap the curvatures and the masses are likely 
to approach those of free electrons. 

To summarize the solutions of Chapter 7 for U positive, an electron near 
the lower edge of the second band has an energy that may be written as 
e(K) = e c + (h 2 /2m e )K 2 ; mjm = l/[(2A/l/) — 1] . 


( 24 ) 
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Here K is the wavevector measured from the zone boundary, and rn e denotes 
the effective mass of the electron near the edge of the second hand. An elec- 
tron near the top of the first band has the energy 

e(K) = e„ - (h 2 /2m h )K 2 ; m k /m = 1/[(2A/I7) + 1] . (25) 


The curvature and hence the mass will be negative near the top of the first 
band, but we have introduced a minus sign into (25) in order that the symbol 
m h for the hole mass will have a positive value — see (20) above. 

The crystal does not weigh any less if the effective mass of a carrier is less 
than the free electron mass, nor is Newton’s second law violated for the crystal 
taken as a whole, ions plus carriers. The important point is that an electron in a 
periodic potential is accelerated relative to the lattice in an applied electric or 
magnetic field as if the mass of the electron were equal to an effective mass 
which we now define. 

We differentiate the result (1) for the group velocity to obtain 


_ *-i d 2 e _ > -i ( d 2 e dk \ 
dt dkdt \dk 2 dt j ' 


(26) 


We know from (5) that dk/dt = F/h, whence 

^ f= h 2 dl \ 

dt Wdk 2 ) ’ ° r d-e/dk 2 dt 


(27) 


If we identify h : /(cPe/dk 2 ) as a mass, then (27) assumes the form of Newton’s 
second law. We define the effective mass m* by 


(28) 


1 _ 1 d 2 e 
m* h 2 dk 2 ' 


It is easy to generalize this to take account of' an anisotropic electron en- 
ergy surface, as for electrons in Si or Ge. We introduce the components of the 
reciprocal effective mass tensor 


/ ] \ 1 d 2 e l dv^ _ / j 

\ m *)w fP dk^ dk v ’ dt 


(29) 


where fi, v are Cartesian coordinates. 


Physical Interpretation of the Effective Mass 

How can an electron of mass rn when put into a crystal respond to applied 
fields as if the mass were m*? It is helpful to think of the process of Bragg re- 
flection of electron waves in a lattice. Consider the weak interaction approxi- 
mation treated in Chapter 7. Near the bottom of the lower band the orbital is 
represented quite adequately by a plane wave exp(ifc.x) with momentum hk; 
the wave component exp[i(& — G)r] with momentum h{k—G) is small and 
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(a) (b) 

Figure 11 Explanation of negative effective masses which occur near, but below, a Brillouin zone 
boundary. In (a) the energy of the electron beam incident on a thin crystal is slightly too low to sat- 
isfy the condition for Bragg reflection and the beam is transmitted through the crystal. The appli- 
cation of a small voltage across the grid may, as in (b), cause the Bragg condition to be satisfied, 
and the electron beam will then be reflected from the appropriate set of crystal planes. 


increases only slowly as k is increased, and in this region m* — m. An increase 
in the reflected component exp[i(/c — G)x] as k is increased represents mo- 
mentum transfer to the electron from the lattice. 

Near the boundary the reflected component is quite large; at the bound- 
ary it becomes equal in amplitude to the forward component, at which point 
the eigenfunctions are standing waves, rather than running waves. Here the 
momentum component h(—\G ) cancels the momentum component h{\ G ). 

A single electron in an energy band may have positive or negative effective 
mass: the states of positive effective mass occur near the bottom of a band be- 
cause positive effective mass means that the band has upward curvature 
( d 2 e/dk 2 is positive). States of negative effective mass occur near the top of the 
band. A negative effective mass means that on going from state k to state 
k + A k, the momentum transfer to the lattice from the electron is larger than 
the momentum transfer from the applied force to the electron. Although k is 
increased by A k by the applied electric field, the approach to Bragg reflection 
can give an overall decrease in the forward momentum of the electron; when 
this happens the effective mass is negative (Fig. 11). 

As we proceed in the second band away from the boundary, the amplitude 
of exp[i(k — G).x] decreases rapidly and m* assumes a small positive value. 
Here the increase in electron velocity resulting from a given external impulse 
is larger than that which a free electron would experience. The lattice makes 
up the difference through the reduced recoil it experiences when the ampli- 
tude of exp[i(k — G)x] is diminished. 

If the energy in a band depends only slightly on k, then the effective mass 
will be very large. That is, m*/m > 1 when cPe/dk 2 is very small. The tight- 
binding approximation discussed in Chapter 9 gives quick insight into the for- 
mation of narrow bands. If the wavefunctions centered on neighboring atoms 
overlap very little, then the overlap integral is small; the width of the band 
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narrow, and the effective mass large. The overlap of wavefunctions centered 
on neighboring atoms is small for the inner or core electrons. The 4 f electrons 
of the rare earth metals, for example, overlap very little. 


Effective Masses in Semiconductors 


In many semiconductors it has been possible to determine by cyclotron 
resonance the effective masses of carriers in the conduction and valence bands 
near the band edges. The determination of the energy surface is equivalent to 
a determination of the effective mass tensor (29). Cyclotron resonance in a 
semiconductor is carried out with centimeter wave or millimeter wave radia- 
tion at low carrier concentration. 

The current carriers are accelerated in helical orbits about the axis of a 
static magnetic field. The angular rotation frequency a> c is 


(CGS) 


= 


eB 


(SI) u> c = 


eB 

m* 


(30) 


where m* is the appropriate cyclotron effective mass. Resonant absorption of 
energy from an rf electric field perpendicular to the static magnetic field 
(Fig. 12) occurs when the rf frequency is equal to the cyclotron frequency. 
Holes and electrons rotate in opposite senses in a magnetic field. 

We consider the experiment for m*/m — 0.1. At f c = 24 GHz, or cd c = 
1.5 X 10 11 s"\ we have B = 860 G at resonance. The line width is determined 
by the collision relaxation time r, and to obtain a distinctive resonance it is 
necessary that w c t ^ 1. The mean free path must be long enough to permit the 
average carrier to get one radian around a circle between collisions. The re- 
quirements are met with the use of higher frequency radiation and higher 
magnetic fields, with high purity crystals in liquid helium. 

In direct-gap semiconductors with band edges at the center of the Bril- 
louin zone, the bands have the structure shown in Fig. 13. The conduction 
band edge is spherical with the effective mass m 0 : 


e c = E g + h 2 k 2 /2m e , 


(31) 


Figure 12 Arrangement of fields in 
a cyclotron resonance experiment in 
a semiconductor. The sense of the 
circulation is opposite for electrons 
and holes. 
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Figure 13 Simplified view of the 
band edge structure of a direct-gap 
semiconductor. 


Table 2 Effective masses of electrons and holes in direct-gap semiconductors 


Crystal 

Electron 

mjm 

Heavy hole 
/m 

Light hole 
mg/m 

Split-off hole 

^SO 

Spin -orbit 
A, eV 

InSb 

0.015 

0.39 

0.021 

(0.11) 

0.82 

InAs 

0.026 

0.41 

0.025 

0.08 

0.43 

InP 

0.073 

0.4 

(0.078) 

(0.15) 

0.11 

GaSb 

0.047 

0.3 

0.06 

(0.14) 

0.80 

GaAs 

0.066 

0.5 

0,082 

0.17 

0.34 

Cu 2 0 

0.99 

— 

0.58 

0.69 

0.13 


referred to the valence hand edge. The valence bands are characteristically 
threefold near the edge, with the heavy hole hh and light hole Ih bands degen- 
erate at the center, and a band soh split off by the spin-orbit splitting A: 

ejjih ) = —k z k~/2m hh ; ejjh j = — ft 2 k 2 /2m ai ; 

ejjsoh) = -A - h 2 k 2 l2m si)k . 

Values of the mass parameters are given in Table 2. The forms (32) are only 
approximate, because even close to k = 0 the heavy and light hole bands are 
not spherical — see the discussion below for Ge and Si. 

The perturbation theory of band edges (Problem 9.8) suggests that the 
electron effective mass should be proportional to the band gap, approximately. 
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for a direct gap crystal. We use Tables 1 and 2 to find the fairly constant values 
mJ{mE g ) = 0.063, 0.060, and 0.051 in (eV) _1 for the series InSb, InAs, and 
InP, in agreement with this suggestion. 

Silicon and Germanium, 

The conduction and valence bands of germanium are shown in Fig. 14, 
based on a combination of theoretical and experimental results. The valence 
band edge in both Si and Ge is at k = 0 and is derived from p 3/2 and p 1/z states 
of the free atoms, as is clear from the tight-binding approximation (Chapter 9) 
to the wavefunctions. 

The p 3/2 level is fourfold degenerate as in the atom; the four states corre- 
spond to values ± 1 and ± |. The p L/2 level is doubly degenerate, with 
rrtj = ±\. The p 3;2 states are higher in energy' than the p 1/2 states; the energy 
difference A is a measure of the spin-orbit interaction. 

The valence band edges are not simple. Holes near the band edge are 
characterized by two effective masses, light and heavy. These arise from the 
two bands formed from the p 3/2 level of the atom. There is also a band formed 
from the p 1/2 level, split off from the p, i/2 level by the spin-orbit interaction. 
The energy surfaces are not spherical, but warped ( QTS , p. 271): 

e(k) = Ak 2 ± [B 2 k 4 + C 2 (k 2 k 2 + k*g + k;k 2 x )] m (33) 

The choice of sign distinguishes the two masses. The split-off band has 
e(k) = — A + Ak 2 . The experiments give, in units h 2 /2m. 

Si: A = -4.29 ; IBI = 0.68 ; ICI = 4.87 ; A = 0.044 eV 
Ge: A = -13.38 ; IB! = 8.48 ; ICI = 13.15 ; A = 0.29 eV 

Roughly, the light and heavy holes in germanium have masses 0.043 m and 
0.34 m ; in silicon 0.16 m and 0.52 m; in diamond 0.7 rn and 2.12 rn. 

The conduction band edges in Ge are at the equivalent points L of the 
Brillouin zone, Fig. 15a. Each band edge has a spheroidal energy' surface ori- 
ented along a (111) crystal axis, with a longitudinal mass m, = 1.59 m and a 
transverse mass m t = 0.082 m. For a static magnetic field at an angle 9 with 
the longitudinal axis of a spheroid, the effective cyclotron mass m c is 

1 _ cos 2 fl , sin 2 fl ( o 

ml ml m ‘ m l ( ' 

Results for Ge are shown in Fig. 16. 

In silicon the conduction band edges are spheroids oriented along the 
equivalent (100) directions in the Brillouin zone, with mass parameters 
m ; = 0.92 m and m t = 0.19 m, as in Fig. 17a. The band edges he along the lines 
labeled A in the zone of Fig. 15a, a little way in from the boundary points X. 

In GaAs we have A = -6.98, B = -4.5, ICI = 6.2, A = 0.341 eV. The 
band structure is shown in Fig. 17b. It has a direct band gap with an isotropic 
conduction electron mass of 0.067 rn. 
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Figure 14 Calculated band structure of germanium, after C. Y. Fong. The general features are in 
good agreement with experiment. The four valence bands are shown in gray. The fine structure of 
the valence band edge is caused by spin-orbit splitting. The energy gap is indirect; the conduction 
band edge is at the point (2ir/a)(| | §). The constant energy surfaces around this point are ellipsoidal. 
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k. 



(a) (b) 

Figure 15 Standard labels of the symmetry points and axes of the Brillouin zones of the fee and 
bcc lattices. The zone centers are J\ In (a) the boundary point at (27r/a)(100) is X; the boundary' 
point at (2 ir/n)(|^) is L; the line A runs between i' andX. In (b) the corresponding symbols are 
H,P, and A. 


Figure 16 Effective cyclotron mass of electrons in germa- 
nium at 4 K for magnetic field directions in a (110) plane. 
There are four independent mass spheroids in Ge, one 
along each [111] axis, but viewed in the (110) plane two 
spheroids always appear equivalent. (After Drcssclhaus, 
Kip, and Kittel.) 



|_|l I I I I I I I I I L—J 

-10 0 10 20 30 40 50 60 70 80 90 100 


Angle in degrees in (110) plane from [001] axis 
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(a) 

Figure 17a Constant energy ellipsoids for 
electrons in silicon, drawn for ra/m, = 5. 


INTRINSIC CARRIER CONCENTRATION 

We want the concentration of intrinsic carriers as a function of tempera- 
ture, in terms of the band gap. We do the calculation for simple parabolic band 
edges. We first calculate in terms of the chemical potential /x the number of 
electrons excited to the conduction band at temperature T. In semiconductor 
physics /x is called the Fermi level. At the temperatures of interest we may 
suppose for the conduction band of a semiconductor that e — /i > k B T, so that 
the Fermi-Dirac distribution function reduces to 

f e — exp[(/j.—e)/k B T] . (35) 

This is the probability that a conduction electron orbital is occupied, in an 
approximation valid when f e < 1. 

The energy of an electron in the conduction band is 

e h = E c + h 2 k 2 /2m e , (36) 

where E c is the energy at the conduction band edge, as in Fig. 18. Here 
m e is the effective mass of an electron. Thus from (6.20) the density of states 
at e is 



( 37 ) 
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The concentration of electrons in the conduction band is 


f" i / 2m \ m 

= D„{e)f„(e)de = — I -^1 e\p(p,/k R T) X 


j* (e~ E c ) m exp(-e/k B T)de , 


(38) 


which integrates to give 

( m k T \ 3/2 

exp[(n ~E c )/k B T] . 


(39) 


The problem is solved for n when p is known. It is useful to calculate the 
equilibrium concentration of holes p. The distribution function/;, for holes is 
related to the electron distribution function/, by/ = 1 — /, because a hole is 
the absence of an electron. We have 


fh = 1 - 


1 

exp[(e - p)/k B T] + 1 


- exp[(e - fi)/k s T\ , 


1 

exp[(;u. - e)/k B T] + 1 


(40) 


provided ip. — e) > k B T. 

If the holes near the top of the valence band behave as particles with 
effective mass m h , the density of hole states is given by 


D h (e) 



=? (E„~er 


(41) 


where E„ is the energy at the valence band edge. Proceeding as in (38) we obtain 
P = J D h (e)f h (e)de = 2 ex p[( £ c“ P-)/k B T] (42) 

for the concentration p of holes in the valence band. 

We multiply together the expressions for n and p to obtain the equilibrium 
relation, with the energy gap E„ = E c — E v as in Fig. 18, 

np = j (tn c m h ) m exp(-Eg/k s r) . (43) 

This useful result does not involve the Fermi level p. At 300 K the value of np 
is 2.10 X 10 19 cm -6 , 2.89 X 10 2fi cm -5 , and 6.55 X 10 12 cm -6 , for the actual 
band structures of Si, Ge, and GaAs, respectively. 

We have nowhere assumed in the derivation that the material is intrinsic: 
the result holds for impurity ionization as well. The only assumption made is 
that the distance of the Fermi level from the edge of both bands is large in 
comparison with k B T. 

A simple kinetic argument shows why the product np is constant at a given 
temperature. Suppose that the equilibrium population of electrons and holes 
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Figure 18 Energy scale for statistical calcula- 
tions. The Fermi distribution function is shown 
on the same scale, for a temperature k B T <8 E 
The Fermi level y is taken to lie well within the 
band gap, as for an intrinsic semiconductor. If 
e = y, then /= f . 


is maintained by black-body photon radiation at temperature T. The photons 
generate electron-hole pairs at a rate A(T), while B(T)np is the rate of the re- 
combination reaction e + h = photon. Then 

dn/dt = A(T) — B(T)np = dp/dt ■ (44) 

In equilibrium dn/dt = 0, dp/dt = 0, whence np = A(T)/B(T). 

Because the product of the electron and hole concentrations is a constant 
independent of impurity concentration at a given temperature, the introduction 
of a small proportion of a suitable impurity to increase n, say, must decrease p. 
This result is important in practice — we can reduce the total carrier concentra- 
tion n + p in an impure crystal, sometimes enormously, by the controlled intro- 
duction of suitable impurities. Such a reduction is called compensation. 

In an intrinsic semiconductor the number of electrons is equal to the 
number of holes, because the thermal excitation of an electron leaves behind a 
hole in the valence band. Thus from (43) we have, letting the subscript i de- 
note intrinsic and £ g = E c — E v , 

( 1 rp \ 3/2 

2^2 J ( r n e m h ) 3/4 exp(- Eg/ZkeT) . (45) 

The intrinsic carrier concentration depends exponentially on E^2k B T, 
where E g is the energy gap. We set (39) equal to (42) to obtain, for the Fermi 
level as measured from the top of the valence band, 

exp(2p/k B T) = {m,Jm e ) m exp(E g Jk B T) ; 
p, = \E f , + \k B T\n{m h /m e ) . 


(46) 

(47) 
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If nij, = m«, then ix = \E g and the Fermi level is in the middle of the forbid- 
den gap. 

Intrinsic Mobility 

The mobility is the magnitude of the drift velocity of a charge carrier per 
unit electric field: 

/a = \v\/E . (48) 

The mobility is defined to be positive for both electrons and holes, although 
their drift velocities are opposite in a given field. By writing /jl,. or fi h with 
subscripts tor the electron or hole mobility we can avoid any confusion be- 
tween fjb as the chemical potential and as the mobility. 

The electrical conductivity is the sum of the electron and hole contributions: 

a = ( nefi e + pe/x h ) , (49) 

where n and p are the concentrations of electrons and holes. In Chapter 6 the 
drift velocity of a charge q was found to be e = qrE/rn, whence 

/x„ = e.T,Jm c ; — er h /m }l , (50) 

where r is the collision time. 

The mobilities depend on temperature as a modest power law. The tem- 
perature dependence of the conductivity in the intrinsic region will be 
dominated by the exponential dependence exp( — E g /2k B T) of the carrier con- 
centration, Eq. (45). 

Table 3 gives experimental values of the mobility at room temperature. 
The mobility in SI units is expressed in mVV-s and is 1 0 4 of the mobility in 
practical units. For most substances the values quoted are limited by the scat- 
tering of carriers by thermal phonons. The hole mobilities typically are smaller 
than the electron mobilities because of the occurrence of band degeneracy at 
the valence band edge at the zone center, thereby making possible interband 
scattering processes that reduce the mobility considerably. 


Table 3 Carrier mobilities at room temperature, in cm 2 /V-s 


Crystal 

Electrons 

Holes 

Crystal 

Electrons 

Holes 

Diamond 

1800 

1200 

GaAs 

8000 

300 

Si 

1350 

480 

GaSb 

5000 

1000 

Ge 

3600 

1800 

PbS 

550 

600 

InSb 

800 

450 

PbSe 

1020 

930 

In As 

30000 

450 

PbTe 

2500 

1000 

InP 

4500 

100 

AgCl 

50 

— 

AlAs 

280 

— 

KBr (100 K) 

100 

— 

AlSb 

900 

400 

SiC 

100 

10-20 
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In some crystals, particularly in ionic crystals, the holes are essentially 
immobile and get about only by thermally-activated hopping from ion to ion. 
The principal cause of this “self-trapping” is the lattice distortion associated 
with the Jahn-Teller effect of degenerate states. The orbital degeneracy neces- 
sary for self-trapping is much more frequent for holes than for electrons. 

There is a tendency for crystals with small energy gaps at direct band edges 
to have high values of the electron mobility. Small gaps lead to small effective 
masses, which favor high mobilities. The highest mobility observed in a bulk 
semiconductor is 5 X 10 6 em 2 /V-s in PbTe at 4 K, where the gap is 0.19 eV. 


IMPURITY CONDUCTIVITY 

Certain impurities and imperfections drastically affect the electrical prop- 
erties of a semiconductor. The addition of boron to silicon in the proportion of 
1 boron atom to 10 5 silicon atoms increases the conductivity of pure silicon at 
room temperature by a factor of 10 3 . In a compound semiconductor a stoichio- 
metric deficiency of one constituent will act as an impurity; such semiconduc- 
tors are known as deficit semiconductors. The deliberate addition of impuri- 
ties to a semiconductor is called doping. 

We consider the effect of impurities in silicon and germanium. These ele- 
ments crystallize in the diamond structure. Each atom forms four covalent 
bonds, one with each of its nearest neighbors, corresponding to the chemical 
valence four. If an impurity atom of valence five, such as phosphorus, arsenic, 
or antimony, is substituted in the lattice in place of a normal atom, there will 
be one valence electron from the impurity atom left over after the four cova- 
lent bonds are established with the nearest neighbors, that is, after the impu- 
rity atom has been accommodated in the structure with as little disturbance as 
possible. Impurity atoms that can give up an electron are called donors. 

Donor States. The structure in Fig. 19 has a positive charge on the impurity 
atom (which has lost one electron). Lattice constant studies have verified that 
the pentavalent impurities enter the lattice by substitution for normal atoms, 
and not in interstitial positions. The crystal as a whole remains neutral because 
the electron remains in the crystal. 

The extra electron moves in the coulomb potential e/er of the impurity 
ion, where £ in a covalent crystal is the static dielectric constant of the 
medium. The factor 1/e takes account of the reduction in the coulomb force 
between charges caused by the electronic polarization of the medium. This 
treatment is valid for orbits large in comparison with the distance between 
atoms, and for slow motions of the electron such that the orbital frequency is 
low in comparison with the frequency oy corresponding to the energy gap. 
These conditions are satisfied quite well in Ce and Si by the donor electron of 
P, As, or Sb. 
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Figure 19 Charges associated with an arsenic impurity atom in silicon. Arsenic has five valence 
electrons, but silicon has only four valence electrons. Thus four electrons on the arsenic form tetra- 
hedral covalent bonds similar to silicon, and the fifth electron is available for conduction. The 
arsenic atom is called a donor because when ionized it donates an electron to the conduction band. 


We estimate the ionization energy of the donor impurity. The Bohr theory 
of the hydrogen atom may be modified to take into account the dielectric 
constant of the medium and the effective mass of an electron in the periodic 
potential of the crystal. The ionization energy of atomic hydrogen is — e 4 m/2h 2 
in CGS and — e 4 m/2(47re 0 ft) 2 in SI. 

In the semiconductor with dielectric constant e we replace e 2 by e 2 /e and 
m by the effective mass m e to obtain 


(CGS) 



( 13.6 m e 
\ e 2 m 


eV ; 


(SI) E d = 


e 4 m e 

2(4-7 Tee 0 h) 2 


(51) 


as the donor ionization energy of the semiconductor. 

The Bohr radius of the ground state of hydrogen is fp/me 2 in CGS or 
4-7re 0 ft z /me 2 in SI. Thus the Bohr radius of the donor is 


(CGS) a d = 

m e e 


mjm ) 


(SI) a d = 


4vee<fi 2 

m e p 2 


(52) 


The application of impurity state theory to germanium and silicon is com- 
plicated by the anisotropic effective mass of the conduction electrons. But the 
dielectric constant has the more important effect on the donor energy because 
it enters as the square, whereas the effective mass enters only as the first power. 

To obtain a general impression of the impurity levels we use m e = 0.1 m 
for electrons in germanium and m e ~ 0.2 m in silicon. The static dielectric 
constant is given in Table 4. The ionization energy of the free hydrogen atom is 
13.6 eV. For germanium the donor ionization energy E d on our model is 5 meV, 
reduced with respect to hydrogen by the factor mjme 2 = 4 X 10 -4 . The 
corresponding result for silicon is 20 meV. Calculations using the correct 
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Table 4 Static relative dielectric constant of semiconductors 


Crystal 

€ 

Crystal 

€ 

Diamond 

5.5 

GaSb 

15.69 

Si 

11.7 

GaAs 

13.13 

Gc 

15.8 

AlAs 

10.1 

InSb 

17.88 

AlSb 

10.3 

InAs 

14.55 

SiC 

10.2 

InP 

12.37 

Cu 2 0 

7.1 


Table 5 Donor ionization energies F.j of pentavalent 
impurities in germanium and silicon, in meV 


P 

As 

Sb 

Si 

45. 

49. 

39. 

Ge 

12.0 

12.7 

9.6 


anisotropic mass tensor predict 9.05 meV for germanium and 29.8 meV for 
silicon. Observed values of donor ionization energies in Si and Ge are given in 
Table 5. In GaAs donors have Ej — G meV. 

The radius of the first Bohr orbit is increased by em/rn e over the value 
0.53 A for the free hydrogen atom. The corresponding radius is (160)(0.53) =* 
80 A in germanium and (60)(0.53) — 30 A in silicon. These are large radii, so 
that donor orbits overlap at relatively low donor concentrations, compared to 
the number of host atoms. With appreciable orbit overlap, an “impurity' band” 
is formed from the donor states: see the discussion ol the metal-insulator tran- 
sition in Chapter 14. 

The semiconductor can conduct in the impurity band by electrons hop- 
ping from donor to donor. The process of impurity band conduction sets in at 
lower donor concentration levels if there are also some acceptor atoms pre- 
sent, so that some of the donors are always ionized. It is easier for a donor 
electron to hop to an ionized (unoccupied) donor than to an occupied donor 
atom, in order that two electrons wall not have to occupy the same site during 
charge transport. 

Acceptor States. A hole may be bound to a trivalent impurity in germanium 
or silicon (Fig. 20), just as an electron is bound to a pentavalent impurity. 
Trivalent impurities such as B, Al, Ga, and In are called acceptors because 
they accept electrons from the valence band in order to complete the covalent 
bonds with neighbor atoms, leaving holes in the band. 
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'Positive hole, as one electron was 
removed from a bond to complete the 
tetrahedral bonds of the boron atom 


p-type silicon 



Acceptor bound level 



Figure 20 Boron has only three valence electrons; it can complete its tetrahedral bonds only by 
taking an electron from a Si-Si bond, leaving behind a hole in the silicon valence band. The positive 
hole is then available for conduction. The boron atom is called an acceptor because when ionized 
it accepts an electron from the valence band. At 0 K the hole is bound. 


Table 6 Acceptor ionization energies E a of trivalent 
impurities in germanium and silicon, in meV 



B 

Al 

Ga 

In 

Si 

45. 

57. 

65. 

157. 

Ge 

10.4 

10.2 

10.8 

11.2 


When an acceptor is ionized a hole is freed, which requires an input 
of energy. On the usual energy band diagram, an electron rises when it gains 
energy, whereas a hole sinks in gaining energy. 

Experimental ionization energies of acceptors in germanium and silicon 
are given in Table 6. The Bohr model applies qualitatively for holes just as for 
electrons, but the degeneracy at the top of the valence band complicates the 
effective mass problem. 

The tables show that donor and acceptor ionization energies in Si are com- 
parable with k B T at room temperature (26 meV), so that the thermal ionization 
of donors and acceptors is important in the electrical conductivity of silicon at 
room temperature. If donor atoms are present in considerably greater num- 
bers than acceptors, the thermal ionization of donors will release electrons 
into the conduction band. The conductivity of the specimen then will be con- 
trolled by electrons (negative charges), and the material is said to be n type. 

If acceptors are dominant, holes will be released into the valence band 
and the conductivity will be controlled by holes (positive charges): the mater- 
ial is p type. The sign of the Hall voltage (6.53) is a rough test for n or p type. 
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Figure 21 Temperature dependence of die free carrier concentration in ultrapure Ge, after 
R. N, Hall. The net concentration of electrically active impurities is 2 X 10 10 cm -3 , as determined 
by Hall coefficient measurements. The rapid onset of intrinsic excitation as the temperature is in- 
creased is evident at low values of 1/T. The carrier concentration is closely constant between 20 K 
and 200 K. 

Another handy laboratory test is the sign of the thermoelectric potential, dis- 
cussed below. 

The numbers of holes and electrons are equal in the intrinsic regime. The 
intrinsic electron concentration n { at 300 K is 1.7 X 10 13 cm -3 in germanium 
and 4.6 X 10 !J cm -3 in silicon. The electrical resistivity of intrinsic material is 
43 ohm-cm for germanium and 2.6 X 10° ohm-cm for silicon. 

Germanium has 4.42 X 10 22 atoms per cm 3 . The purification of Ge lias 
been carried further than any other element. The concentration of the 
common electrically active impurities — the shallow donor and acceptor 
impurities — has been reduced below 1 impurity atom in 10 11 Ge atoms 
(Fig. 21). For example, the concentration of P in Ge can be reduced below 
4 X 10 10 cm -3 . There are impurities (H, O, Si, C) whose concentrations in Ge 
cannot usually be reduced below 10 12 — 10 14 cm -3 , but these do not affect elec- 
trical measurements and therefore may he hard to detect. 

Thermal Ionization of Donors and Acceptors 

The calculation of the equilibrium concentration of conduction electrons 
from ionized donors is identical with the standard calculation in statistical me- 
chanics of the thermal ionization of hydrogen atoms (TP, p. 369). If there are 
no acceptors present, the result in the low temperature limit k B T < E d is 

n = (noZV,() 1/2 exp( —E d /2k B T) , 


(53) 
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Figure 22 Electrical conductivity and hole concentration p calculated as a function of electron 
concentration n for a semiconductor at a temperature such that np = 10 2 ' 1 cm -6 . The conductivity 
is symmetrical ahout n = 10 ll> cm -3 . For n > 10 10 , the specimen is n type; for n < 10 10 , it is p type. 
We have taken fx,. = p.i„ for the mobilities. 


with rt 0 = 2(m ( k H T/2 irh 2 ! 3/2 ; here N d is the concentration of donors. To obtain 
(53) we apply the laws of chemical equilibria to the concentration ratio 
and then set [A^] = [e] = n. Identical results hold for acceptors, 
under the assumption of no donor atoms. 

If the donor and acceptor concentrations are comparable, affairs are com- 
plicated and the equations are solved by numerical methods. However, the law 
of mass action (43) requires the np product to be constant at a given tempera- 
ture. An excess of donors will increase the electron concentration and de- 
crease the hole concentration; the sum n + p will increase. The conductivity 
will increase as n + p if the mobilities are equal, as in Fig. 22. 

THERMOELECTRIC EFFECTS 

Consider a semiconductor maintained at a constant temperature while an 
electric field drives through it an electric current density If the current is 
carried only by electrons, the charge flux is 

jq = n(-e)(-p. e )E = nefi e E , (54) 

where p, e is the electron mobility. The average energy transported by an elec- 
tron is referred to the Fermi level fi, 

(E c — fi) + 2 kfjT , 
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where E n is the energy at the conduction band edge. We refer the energy to 
the Fermi level because different conductors in contact have the same Fermi 
level. The energy flux that accompanies the charge flux is 

ju - n(E r - /i. + i k s T){— fi e )E . (55) 

The Peltier coefficient II is defined by j v = Ilf.; or the energy carried 
per unit charge. For electrons, 

n e = -(E c - ^+lk B T)/e (56) 

and is negative because the energy flux is opposite to the charge flux. For 
holes 

jq^ PWhE ; ju = p(^ ~ E r + pc B T)fi h E , (57) 

where E, is the energy at the valence band edge. Thus 

II, = (j it - E v + lk n T)!e (58) 

and is positive. Equations (56) and (58) are the result of our simple drift veloc- 
ity theory; a treatment by the Boltzmann transport equation gives minor nu- 
merical differences. 2 

The absolute thermoelectric power Q is defined from the open circuit 
electric field created by a temperature gradient: 

E = Q grad T . (59) 

The Peltier coefficient II is related to the thermoelectric power <2 by 

n - QT . (60) 

This is the famous Kelvin relation of irreversible thermodynamics. A measure- 
ment of the sign of the voltage across a semiconductor specimen, one end of 
which is heated, is a rough and ready way to tell if the specimen is n type or p 
type (Fig. 23). 


SEMIMETALS 

In semimetals the conduction band edge is very 7 slightly lower in energy 
than the valence band edge. A small overlap in energy of the conduction and 
valence bands leads to small concentration of holes in the valence band and of 
electrons in the conduction band (Table 7). Three of the semimetals, arsenic, 
antimony, and bismuth, are in group V of the periodic table. 

Their atoms associate in pairs in the crystal lattice, with two ions and ten 
valence electrons per primitive cell. The even number of valence electrons 

' A simple discussion of Uoltzmann transport theory is given in Appendix 1 
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Figure 23 Peltier coefficient of 
n and p silicon as a function of 
temperature. Above 600 K the spec- 
imens act as intrinsic semiconduc- 
tors. The curves are calculated and 
tire points are experimental. (After 
T H. Gehalle and G. W. Hull.) 



Table 7 Electron and hole concentrations in semimetals 


Semimetal 

n„, in cm 3 

n h , in cm 3 

Arsenic 

(2.12 ± 0.01) X ]0 2 ° 

(2.12 ± 0.01) X 10 29 

Antimony 

(5.54 ± 0.05) X 10 19 

(5.49 ± 0.03) X 10 19 

Bismuth 

2.88 X 10 17 

3.00 X 10 17 

Graphite 

2.72 X 10 18 

2.04 X 10 1S 


could allow these elements to be insulators. Like semiconductors, the semi- 
metals may be doped with suitable impurities to vary the relative numbers of 
holes and electrons. Their concentrations may also be varied with pressure, for 
the band edge overlap varies with pressure. 


SUPERLATTICES 

Consider a multilayer crystal of alternating thin layers of different composi- 
tions. Coherent layers on a nanometer thickness scale may be deposited by 
molecular-beam epitaxy or metal-organic vapor deposition, thus building up a 
superperiodic structure on a large scale. Systems of alternate layers of GaAs and 
GaAlAs have been studied to 50 periods or more, with lattice spacing A of per- 
haps 5 nm (50 A). A superperiodic crystal potential arises from the superperiodic 
structure and acts on the conduction electrons and holes to create new (small) 
Brillouin zones and mini energy bands superposed on the band structures of the 
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constituent layers. Here we treat the motion of an electron in a superlattice in an 
applied electric field. 

Bloch Oscillator 

Consider a collisionless electron in a periodic lattice in one dimension, 
with motion normal to the planes of the superlattice. The equation of motion 
in a constant electric field parallel to k is hdk/dt = —eE or, for motion 
across a Brillouin zone with reciprocal lattice vector G — 2-irlA, we have 
hQ = H2 tt/A = eET, where T is the period of the motion. The Bloch 
frequency of the motion is w B = 2 tt/T = eEA/h. The electron accelerates from 
k = 0 toward the zone boundary; when it reaches k = tt/A it reappears (as by 
an Umklapp process) at the zone boundary at the identical point —tt/A, using 
the argument of Chapter 2. 

We consider the motion in a model system in real space. We suppose that 
the electron lies in a simple energy band of width e 0 : 

e = e 0 (l— cos kA) . (61) 

The velocity in k - space (momentum space) is 

v = h 'de/dk = {Ae^/h) sin kA , (62) 

and the position of the electron in real space, with the initial condition z = 0 
at t = 0, is given by 

z = fv dt — Jdk v(k)(dt/dk) — {Aejh) Jdk(—h/eE) sin kA 

~(—e 0 /eE)(coskA — L) = (—e 0 /eE)(cos(^eEAt/h)—l) . (63) 

This confirms that the Bloch oscillation frequency in real space is oj k — cEA/h. 
The motion in the periodic lattice is quite different from the motion in free 
space, for which the acceleration is constant. 

Zener Tunneling 

Thus far we have considered the effect of the electrostatic potential ~eEz 
(or — eEnA ) on one energy band; the potential tilts the whole band. Higher 
bands will also be tilted similarly, creating the possibility of crossing between 
ladder levels of different bands. The interaction between different band levels at 
the same energy opens the possibility for an electron in one band at n to cross to 
another band at This field-induced interband tunneling is an example of 
Zener breakdown, met most often at a single junction as in the Zener diode. 

SUMMARY 

• The motion of a wave packet centered at wavevector k is described by 
F = Hdk/dt , where F is the applied force. The motion in real space is ob- 
tained from the group velocity v g = fi~ ' V^eik). 



• The smaller the energy gap, the smaller is the effective mass im*i near the 

g a P- 

• A crystal with one hole has one empty electron state in an otherwise filled 
band. The properties of the hole are those of the IV — 1 electrons in this 
band. 

(a) If the electron is missing from the state of wavevector k , then the 
wavevector of the hole is k ( , = — k, . 

(b) The rate of change of k,, in an applied field requires the assignment of a 
positive charge to the hole: e h - e — ~e„. 

(c) If v e is the velocity an electron would have in the state k., then the veloc- 
ity to be ascribed to the hole of wavevector k* = - k ( is v h = v e . 

(d) The energy of the hole referred to zero for a filled band is positive and is 

e h (K) = — e(k„). 

(e) The effective mass of a hole is opposite to the effective mass of an elec- 
tron at the same point on the energy band: m h — — m e . 


Problems 

1. Impurity orbits. Indium antimonidc has £„ = 0.23 eV; dielectric constant e = 18; 
electron effective mass m e = 0.015 m. Calculate (a) the donor ionization energy; 
(b) the radius of the ground state orbit, (c) At what minimum donor concentration 
will appreciable overlap effects between the orbits of adjacent impurity atoms 
occur? This overlap tends to produce an impurity band — a baud of energy levels 
which permit conductivity presumably by a hopping mechanism in which electrons 
move from one impurity site to a neighboring ionized impurity site. 

2. Ionization of donors. In a particular semiconductor there are 10 13 donors/cm 3 
with an ionization energy of 1 meV and an effective mass 0.01 in. (a) Estimate the 
concentration of conduction electrons at 4 K. (b) What is the value of the Hall coeff- 
icent? Assume no acceptor atoms are present and that E g > k e T. 

3. Hall effect with two carrier types. Assuming concentration n, p; relaxation times 
r e , T;,; and masses m e , show that the Hall coefficient in the drift velocity 7 approxi- 
mation is 


(CGS) 


y p — nb 2 
“ ' (p + nbf 


where b = pjp-h is the mobility 7 ratio. In the derivation neglect terms of order B' 2 . In 
SI we drop the c. Hint: In the presence of a longitudinal electric field, find the 
transverse electric field such that the transverse current vanishes. The algebra may 
seem tedious, but the result is worth the trouble. Use (6.64), but for two carrier 
types; neglect (<a,.T) 2 in comparison with w,.T. 
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4. Cyclotron resonance for a spheroidal energy surface. Consider the energy 
surface 


e(k) = fi: 


k l +k l + l\ 

2 m t 2 rriij 


where m, is the transverse mass parameter and m ; is the longitudinal mass parame- 
ter. A surface on which e(k) is constant will be a spheroid. Use the equation of mo- 
tion (6), with V = h~ to show that = eB/(mim t ) m c when the static magnetic 
field B lies in the xy plane. This result agrees with (34) when 6 — tt/2. The result is 
in CGS; to obtain SI, omit the c. 

5. Magnetoresistance with two carrier types. Problem 6.9 shows that in the drift 
velocity approximation the motion of charge carriers in electric and magnetic fields 
does not lead to transverse magnetoresistance. The result is different with two car- 
rier types. Consider a conductor with a concentration n of electrons of effective 
mass m e and relaxation time r c ; and a concentration p of holes of effective 
mass m h and relaxation time T h . Treat the limit of very' strong magnetic fields, ai c t > 1. 
(a) Show in this limit that <r yr = (n — p)ec/B. (b) Show that the Hall field is given by, 
with Q = to c T, 


which vanishes if n — p. (c) Show that the effective conductivity in the x direction is 


_ ec 
Orff - B 




+ (n - pf 


Q* 


If n = p, cr or B ti n, i~ p, a saturates in strong fields; that is, it approaches a limit 
independent of B as B — > 00 . 
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Copper Aluminum 


Figure 1 Free electron Fermi surfaces for fee metals with one (Cu) and three (Al) valence elec 
trons per primitive cell. The Fermi surface shown for copper has been deformed from a sphen 
to agree with the experimental results. The second zone of aluminum is nearly half-filled witl 
electrons. (A. R. Mackintosh.) 
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CHAPTER 9: FERMI SURFACES AND METALS 


Few people would define a metal as “a solid 
with a Fermi surface.” This may nevertheless be 
the most meaningful definition of a metal one 
can give today; it represents a profound advance 
in the understanding of why metals behave as 
they do. The concept of the Fermi surface, as 
developed by quantum physics, provides a pre- 
cise explanation of the main physical properties 
of metals. 

A. R. Mackintosh 

The Fermi surface is the surface of constant energy e F in k space. The 
Fermi surface separates the unfilled orbitals from the filled orbitals, at 
absolute zero. The electrical properties of the metal are determined by the 
volume and shape of the Fermi surface, because the current is due to changes 
in the occupancy of states near the Fermi surface. 

The shape may be very intricate as viewed in the reduced zone scheme 
below and yet have a simple interpretation when reconstructed to lie near the 
surface of a sphere. We exhibit in Fig. 1 the free electron Fermi surfaces con- 
structed for two metals that have the face-centered cubic crystal structure: 
copper, with one valence electron, and aluminum, with three. The free elec- 
tron Fermi surfaces were developed from spheres of radius k F determined by 
the valence electron concentration. The surface for copper is deformed by in- 
teraction with the lattice. How do we construct these surfaces from a sphere? 
The constructions require the reduced and also the periodic zone schemes. 

Reduced Zone Scheme 

It is always possible to select the wavevector index k of any Bloch function 
to lie within the first Brillouin zone. The procedure is known as mapping the 
band in the reduced zone scheme. 

If we encounter a Bloch function written as <f k ir) = e ik r M k (r), with k' 
outside the first zone, as in Fig. 2, we may always find a suitable reciprocal lat- 
tice vector G such that k = k' + G lies within the first Brillouin zone. Then 

<M r) - £ ik r Wk'(r) = e ikt (e iG ' T u k ,(r)) 

= e" kr w k (r) = ip k (r) , 

where r/ k (r) = e -!C ' r w k .(r). Both e~ iG r and u k .(r) are periodic in the crystal lat- 
tice, so w k (r) is also, whence li/p'r) is of the Bloch form. 

Even with free electrons it is useful to work in the reduced zone scheme, 
as in Fig. 3. Any energy e k ' for k' outside the first zone is equal to an e k in the 
first zone, where k = k' + G. Thus we need solve for the energy only in the 
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Figure 2 First Brillouin zone of a square lattice of 
side a. The wavevector k' can be carried into the first 
zone by forming k' + G. The wavevector at a point A on 
the zone boundary is carried by G to the point A' on the 
opposite boundary of the same zone. Shall we count 
both A and A' as lying in the first zone? Because they 
can be connected by a reciprocal lattice vector, we 
count them as one identical point in the zone. 


Figure 3 Energy-wavevector relation fj. = h 2 k 2 /2m for 
free electrons as drawn in the reduced zone scheme. 
This construction often gives a useful idea of the over- 
all appearance of the band structure of a crystal. The 
branch AC if displaced by —2 rr/a gives the usual free 
electron curve for negative k, as suggested by the 
dashed curve. The branch A'C if displaced by 2ir la 
gives the usual curve for positive k. A crystal potential 
U(x) will introduce band gaps at the edges of the zone 
(as at A and A') and at the center of the zone (as at C). 
The point C when viewed in the extended zone 
scheme falls at the edges of the second zone. The 
overall width and gross features of the band structure 
are often indicated properly by such free electron 
bands in the reduced zone scheme. 




first Brillouin zone, for each band. An energy band is a single branch of the e k 
versus k surface. In the reduced zone scheme we may find different energies 
at the same value of the wavevector. Each different energy characterizes a dif- 
ferent band. Two bands are shown in Fig. 3. 

Two wavefunctions at the same k but of different energies will be inde- 
pendent of each other: the wavefunctions will be made up of different combi- 
nations of the plane wave components exp[i(k + G) ■ r] in the expansion of 
(7.29). Because the values of the coefficients C(k + G) will differ for the 
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different bands, we should add a symbol, say n, to the C’s to serve as a band 
index: C„(k + G). Thus the Bloch function for a state of wavevector k in the 
band n can be written as 

' k = exp(ik • r)u„ k (r) = ^ C„(k + G) exp[i(k + G) • r] . 

G 

Periodic Zone Scheme 

We can repeat a given Brillouin zone periodically through all of wavevec- 
tor space. To repeat a zone, we translate the zone by a reciprocal lattice vector. 
If we can translate a band from other zones into the first zone, we can translate 
a band in the first zone into every other zone. In this scheme the energy e k of a 
band is a periodic function in the reciprocal lattice: 

e k = e k+G • ( 2 ) 

Here e k+G is understood to refer to the same energy band as e k . 



(a) 


(b) 


(c) 


Figure 4 Three energy bands of a linear lattice plotted in (a) the extended (Brillouin), 
(b) reduced, and (c) periodic zone schemes. 
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The result of this construction is known as the periodic zone scheme. 
The periodic property of the energy also can be seen easily from the central 
equation (7.27). 

Consider for example an energy band of a simple cubic lattice as calcu- 
lated in the tight-binding approximation in (13) below: 

e k = —a — 2y ( cos k x a + cos k y a + cos k z a) , (3) 

where a and y are constants. One reciprocal lattice vector of the sc lattice is 
G = (27r/a)x; if we add this vector to k the only change in (3) is 

cos k/i — > cos iky + 2ir/a)a = cos (k x a + 2-7t) , 

but this is identically equal to cos k x a. The energy is unchanged when the 
wavevector is increased by a reciprocal lattice vector, so that the energy is a 
periodic function of the wavevector. 

Three different zone schemes are useful (Fig. 4): 

• The extended zone scheme in which different bands are drawn in differ- 
ent zones in wavevector space. 

• The reduced zone scheme in which all bands are drawn in the first 
Brillouin zone. 

• The periodic zone scheme in wliich every band is drawn in every zone. 


CONSTRUCTION OF FERMI SURFACES 

We consider in Fig. 5 the analysis for a square lattice. The equation of the 
zone boundaries is 2k-G + G 2 = 0 and is satisfied if k terminates on the plane 
normal to G at the midpoint of G. The first Brillouin zone of the square lattice 
is the area enclosed by the perpendicular bisectors of G! and of the three reci- 
procal lattice vectors equivalent by symmetry to C : in Fig. 5a. These four reci- 
procal lattice vectors are ±(2v/a)k x and ±(27r/<z)k r 

The second zone is constructed from G 2 and the three vectors equivalent 
to it by symmetry, and similarly for the third zone. The pieces of the second 
and third zones are drawn in Fig. 5b. 

To determine the boundaries of some zones we have to consider sets of 
several nonequivalent reciprocal lattice vectors. Thus the boundaries of sec- 
tion 3,, of the third zone are formed from the perpendicular bisectors of three 
G’s, namely (27 r/alk*; (477/(7)1^; and (27r/a)(k* + k^). 

The free electron Fermi surface for an arbitrary electron concentration is 
shown in Fig. 6. It is inconvenient to have sections of the Fermi surface that 
belong to the same zone appear detached from one another. The detachment 
can be repaired by a transformation to the reduced zone scheme. 

We take the triangle labeled 2 a and move it by a reciprocal lattice vector 
G = — (277/'a)k. such that the triangle reappears in the area of the first 
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Figure 5 (a) Construction in k space of the first three Brillouin zones of a square lattice. The 
three shortest forms of the reciprocal lattice vectors are indicated as Gj, G 2 , and G 3 . The lines 
drawn are the perpendicular bisectors of these G’s. (b) On constructing all lines equivalent by 
symmetry to the three lines in (a) we obtain the regions in k space which form the first three 
Brillouin zones. The numbers denote the zone to which the regions belong; the numbers here are 
ordered according to the length of the vector G involved in the construction of the outer boundary 
of the region. 



Figure 6 Brillouin zones of a square lattice in two 
dimensions. The circle shown is a surface of constant 
energy for free electrons; it will be the Fermi surface 
for some particular value of the electron concentra- 
tion. The total area of the filled region in k space de- 
pends only on the electron concentration and is inde- 
pendent of the interaction of the electrons with the 
lattice. The shape of the Fermi surface depends 
on the lattice interaction, and the shape will not be 
an exact circle in an actual lattice. The labels within 
the sections of the second and third zones refer to 
Fig. 7. 


0 


1st zone 




Figure 7 Mapping of the first, second, and third Brillouin zones in the reduced zone scheme. 
The sections of the second zone in Fig. 6 are put together into a square by translation through an 
appropriate reciprocal lattice vector. A different G is needed for each piece of a zone. 
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1st zone 



2nd zone 


3rd zone 


Figure 8 The free electron Fermi surface of Fig. 6, as viewed in the reduced zone scheme. The 
shaded areas represent occupied electron states. Parts of the Fermi surface fall in the second, 
third, and fourth zones. The fourth zone is not shown. The first zone is entirely occupied. 


Figure 9 The Fermi surface in the third zone as 
drawn in the periodic zone scheme. The figure was 
constructed by repeating the third zone of Fig. 8. 



Brillouin zone (Fig. 7). Other reciprocal lattice vectors will shift the triangles 
2 b , 2 C , 2 d to other parts of the first zone, completing the mapping of the second 
zone into the reduced zone scheme. The parts of the Fermi surface falling in 
the second zone are now connected, as shown in Fig. 8. 

A third zone is assembled into a square in Fig. 8, but the parts of the 
Fermi surface still appear disconnected. When we look at it in the periodic 
zone scheme (Fig. 9), the Fermi surface forms a lattice of rosettes. 

Nearly Free Electrons 

How do we go from Fermi surfaces for free electrons to Fermi surfaces 
for nearly free electrons? We can make approximate constructions freehand by 
the use of four facts: 

• The interaction of the electron with the periodic potential of the crystal 
creates energy gaps at the zone boundaries. 

• Almost always the Fermi surface will intersect zone boundaries perpendicu- 
larly. 
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2nd zone 



3rd zone 


Figure 10 Qualitative impression of the effect of a weak periodic crystal potential on the Fermi 
surface of Fig. 8. At one point on each Fermi surface we have shown the vector grad k e. In the sec- 
ond zone the energy increases toward the interior of the figure, and in the third zone the energy 
increases toward the exterior. The shaded regions are filled with electrons and are lower in energy 
than the unshaded regions. We shall see that a Fermi surface like that of the third zone is elec- 
tronlike, whereas one like that of the second zone is holelike. 



Figure 11 Harrison construction of free elec- 
tron Fermi surfaces on the second, third, and 
fourth zones for a square lattice. The Fermi 
surface encloses the entire first zone, which 
therefore is filled with electrons. 


• The crystal potential will round out sharp comers in the Fermi surfaces. 

• The total volume enclosed by the Fermi surface depends only on the 
electron concentration and is independent of the details of the lattice 
interaction. 

We cannot make quantitative statements without calculation, but qualitatively 
we expect the Fermi surfaces in the second and third zones of Fig. 8 to be 
changed as shown in Fig. 10. 

Freehand impressions of the Fermi surfaces derived from free electron 
surfaces are useful. Fermi surfaces for free electrons are constructed by a pro- 
cedure credited to Harrison, Fig. 11. The reciprocal lattice points are deter- 
mined, and a free electron sphere of radius appropriate to the electron 
concentration is drawn around each point. Any point in k space that lies within 
at least one sphere corresponds to an occupied state in the first zone. Points 
within at least two spheres correspond to occupied states in the second zone, 
and similarly for points in three or more spheres. 

We said earlier that the alkali metals are the simplest metals, with weak in- 
teractions between the conduction electrons and the lattice. Because the 
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alkalis have only one valence electron per atom, the first Brillouin zone bound- 
aries are distant from the approximately spherical Fermi surface that fills one- 
half of the volume of the zone. It is known by calculation and experiment that 
the Fermi surface of Na is closely spherical, and that the Fermi surface for Cs 
is deformed by perhaps 10 percent from a sphere. 

The divalent metals Be and Mg also have weak lattice interactions and 
nearly spherical Fermi surfaces. But because they have two valence electrons 
each, the Fermi surface encloses twice the volume in k space as for the alkalis. 
That is, the volume enclosed by the Fermi surface is exactly equal to that of a 
zone, but because the surface is spherical it extends out of the first zone and 
into the second zone. 

ELECTRON ORBITS, HOLE ORBITS, AND OPEN ORBITS 

We saw in Eq. (8.7) that electrons in a static magnetic field move on a 
curve of constant energy on a plane normal to B. An electron on the Fermi 
surface will move in a curve on the Fermi surface, because this is a surface of 
constant energy. Three types of orbits in a magnetic field are shown in Fig. 12. 

The closed orbits of (a) and (b) are traversed in opposite senses. Because 
particles of opposite charge circulate in a magnetic field in opposite senses, we 
say that one orbit is electronlike and the other orbit is holelike. Electrons in 
holelike orbits move in a magnetic field as if endowed with a positive charge. 
This is consistent with the treatment of holes in Chapter 8. 

In (c) the orbit is not closed: the particle on reaching the zone boundary 
at A is instantly folded back to B, where B is equivalent to B' because 


Hole orbit Electron orbit Open orbits 





Figure 12 Motion in a magnetic field of the wavevector of an electron on the Fermi surface, in 
(a) and (b) for Fermi surfaces topologically equivalent to those of Fig. 10. In (a) the wavevector 
moves around the orbit in a clockwise direction; in (b) the wavevector moves around the orbit in a 
counter-clockwise direction. The direction in (b) is what we expect for a free electron of charge 
—e; the smaller k values have the lower energy, so that the filled electron states lie inside the 
Fermi surface. We call the orbit in (b) electronlike. The sense of the motion in a magnetic field is 
opposite in (a) to that in (b), so that we refer to the orbit in (a) as holelike. A hole moves as a par- 
ticle of positive charge e. In (c) for a rectangular zone we show the motion on an open orbit in the 
periodic zone scheme. An open orbit is topologically intermediate between a hole orbit and an 
electron orbit. 
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Figure 13 (a) Vacant states at the corners of 
an almost-filled band, drawn in the reduced 
zone scheme, (b) In the periodic zone scheme 
the various parts of the Fermi surface are con- 
nected. Each circle forms a holelike orbit. The 
different circles are entirely equivalent to 
each other, and the density of states is that of a 
single circle. (The orbits need not be true cir- 
cles: for the lattice shown it is only required 
that the orbits have fourfold symmetry.) 


Figure 14 Vacant states near the top of an almost filled band in a two- 
dimensional crystal. This figure is equivalent to Fig. 12a. 



Figure 15 Constant energy surface in the Brillouin zone of a simple cubic lattice, for the assumed 
energy band e k = —a — 2y(cos k x a + cos k y a + cos k.a). (a) Constant energy surface 6 = —a. 
The filled volume contains one electron per primitive cell, (b) The same surface exhibited in the 
periodic zone scheme. The connectivity of the orbits is clearly shown. Can you find electron, hole, 
and open orbits for motion in a magnetic field Bz? (A. Sommerfeld and H. A. Bethe.) 


they are connected by a reciprocal lattice vector. Such an orbit is called an 
open orbit. Open orbits have an important effect on the magnetoresistance. 

Vacant orbitals near the top of an otherwise filled band give rise to hole- 
like orbits, as in Figs. 13 and 14. A view of a possible energy surface in three 
dimensions is given in Fig. 15. 
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Orbits that enclose filled states are electron orbits. Orbits that en- 
close empty states are hole orbits. Orbits that move from zone to zone 
without closing are open orbits. 


CALCULATION OF ENERGY BANDS 

Wigner and Seitz, who in 1933 performed the first serious band calcula- 
tions, refer to afternoons spent on the manual desk calculators of those days, 
using one afternoon for a trial wavefunetion. Here we limit ourselves to three 
introductory methods: the tight-binding method, useful for interpolation; the 
Wigner-Seitz method, useful for the visualization and understanding of the 
alkali metals; and the pseudopotential method, utilizing the general theory 
of Chapter 7, which shows the simplicity of many problems. 

Tight Binding Method for Energy Bands 

Let us start with neutral separated atoms and watch the changes in the 
atomic energy' levels as the charge distributions of adjacent atoms overlap 
when the atoms are brought together to form a crystal. Consider two hydrogen 
atoms, each with an electron in the Is ground state. The wavefunctions ifi A , tfi B 
on the separated atoms are shown in Fig. 16a. 

As the atoms are brought together, their wavefunctions overlap. We con- 
sider the two combinations >p A ± i (i B . Each combination shares an electron 
with the two protons, but an electron in the state ip A + ip B will have a some- 
what lower energy 7 than in the state i f/ A — t/» B . 

In t/» A + t/Oj the electron spends part of the time in the region midway 
between the two protons, and in this region it is in the attractive potential of 
both protons at once, thereby increasing the binding energy. In i fr A — i// B the 
probability density vanishes midway between the nuclei; an extra binding docs 
not appear. 



(a) 


(b) (c) 

Figure 16 (a) Schematic drawing of wavefunctions of electrons on two hydrogen atoms at large 
separation, (b) Ground state wavefunetion at closer separation, (c) Excited state wavefunetion. 
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Free 

atom 


Figure 17 The Is band of a ring of 20 
hydrogen atoms; the one-electron energies 
are calculated in the tight-binding approxi- 
mation with the nearest-neighbor overlap 
integral of Eq. (9), 

As two atoms arc brought together, two separated energy levels are 
formed for each level of the isolated atom. For N atoms, N orbitals are formed 
for each orbital of the isolated atom (Fig. 17). 

As free atoms are brought together, the coulomb interaction between the 
atom cores and the electron splits the energy levels, spreading them into 
bands. Each state of given, quantum number of the free atom is spread in the 
crystal into a band of energies. The width of the hand is proportional to the 
strength of the overlap interaction between neighboring atoms. 

There will also be bands formed from p, d, . . . states (l — 1, 2, . . .) of the 
free atoms. States degenerate in the free atom will form different bands. Each 
will not have the same energy as any other band over any substantial range of 
the wavevector. Bands may coincide in energy at certain values of k in the 
Brillouin zone. 

The approximation that starts out from the wavefunctions of the free atoms 
is known as the tight-binding approximation or the LCAO (linear combination 
of atomic orbitals) approximation. The approximation is quite good for the inner 
electrons of atoms, hut it is not often a good description of the conduction elec- 
trons themselves. It is used to describe approximately the d bands of the transi- 
tion metals and the valence bands of diamondlike and inert gas crystals. 

Suppose that the ground state of an electron moving in the potential 
U( r) of an isolated atom is <p(r), an s state. The treatment of bands arising from 
degenerate (p, d, . . .) atomic levels is more complicated. If the influence of 
one atom on another is small, we obtain an approximate wavefunction for one 
electron in the whole crystal by taking 

«Ak(r) = 2 C kj<P ( r - * j) , 

j 



0 ! i i i i i i i i i l 

0 1 2 3 4 5 

Nearest-neighbor distance, in Bohr radii 


( 4 ) 
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where the sum is over all lattice points. We assume the primitive basis contains 
one atom. This function is of the Bloch form (7.7) if C kl = N 1/2 e lk r , which 
gives, for a crystal of N atoms, 

Wr) = 2 exp(ik • rj)<p(r - rj) . (5) 

We prove (5) is of the Bloch form. Consider a translation T connecting 
two lattice points: 

*A k (r + T) = N~ ]/2 ^ exp(ik • i})<p(r + T - Tj) 
j 

= exp(ik • T) N~ m X exp[ik • (r } - T)]^[r - (q - T)] (5) 

j 

— expfik • T) </>k(r) , 
exactly the Bloch condition. 

We find the first-order energy by calculating the diagonal matrix elements 
of the hamiltonian of the crystal: 

<k|T/|k) = A 7_: X 2 expfik ■ {r, - r m )] , (7) 

j m 

where <p m = <p ( r - rj. Writing p,„ - r,„ - r,, 

<k \H |k) = 2 exp( -ik • pj J dV cp *( r - p m )H<p( r) . (8) 

m 

We now neglect all integrals in (8) except those on the same atom and 
those between nearest neighbors connected by p. We write 

/ dV (p*(r)Hsp(r) = —a ; / dV <p *( r — p)H<p(r) — — y ; (9) 

and we have the first-order energy, provided (klk) = 1: 

<k|/f|k> = -a - y 2 exp(-tk- p m ) = £ k ■ (10) 

m 

The dependence of the overlap energy y on the interatomic separation p 
can be evaluated explicitly for two hydrogen atoms in Is states. In rydberg 
energy units, Ry = meV2ti 2 , we have 

y( Ry) = 2(1 + p/a Q ) exp ( -p/a 0 ) , (11) 


where a 0 = h 2 /me 2 . The overlap energy decreases exponentially with the 
separation. 



9 Fermi Surfaces and Metals 


235 


For a simple cubic structure the nearest-neighbor atoms are at 


Pm = (± «A0) ; 

(0, ± a,0) ; (0,0, ± a) , 

(12) 

so that (10) becomes 



e k = — a — 2y(cos 

k x a + cos kyd + cos k z a) . 

(13) 


Thus the energies are confined to a band of width 12y. The weaker the over- 
lap, the narrower is the energy band. A constant energy surface is shown 
in Fig. 15. For ka < 1, e k — —a — 6y + y k 2 a 2 . The effective mass is m* = 
h 2 /2ya 2 . When the overlap integral y is small, the band is narrow and the effec- 
tive mass is high. 

We considered one orbital of each free atom and obtained one band e k . 
The number of orbitals in the band that corresponds to a nondegenerate 
atomic level is 2 N, for N atoms. We see this directly: values of k within the first 
Brillouin zone define independent wavefunctions. The simple cubic zone has 
— it /a < k x < 77/a, etc. The zone volume is 877 3 /a 3 . The number of orbitals 
(counting both spin orientations) per unit volume of k space is V/47T 3 , so that 
the number of orbitals is 2 'Via 2 '. Here V is the volume of the crystal, and 1/a 3 is 
the number of atoms per unit volume. Thus there are 2 N orbitals. 

For the fee structure with eight nearest neighbors, 

e k = -a - 8y cos \kjn cos \ k y a cos \ k z a . (14) 

For the fee structure with 12 nearest neighbors, 

e k = —a — 4y(cos \ k y a cos | k z a + cos § k,a cos | k x a + cos |k r a cos \ k y a) . (15) 

A constant energy surface is shown in Fig. 18. 



Figure 18 A constant energy surface of an fee crystal 
structure, in the nearest-neighbor tight-binding approx- 
imation. The surface shown has e = — a + 21 yl. 
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Wigner-Seitz Method 

Wigner and Seitz showed that for the alkali metals there is no inconsis- 
tency between the electron wavefunctions of free atoms and the nearly free 
electron model of the band structure of a crystal. Over most of a band the 
energy may depend on the wavevector nearly as for a free electron. However 
the Bloch wavefunction, unlike a plane wave, will pile up charge on the posi- 
tive ion cores as in the atomic wavefunction. 

A Bloch function satisfies the wave equation 

(2m p2 + ^ r ) j e tV ' r u^r) = e k e‘ k ' r u k (r) . (16) 

With p = —ih grad, we have 

p e ik r u k ( r) = ftk e ilrr « k (r) + <? itr pu k (r) ; 
p 2 e itr u k (r) = ihkf e ik ' r u^r) + e ikr (2ftk • p)u k (r) + e itr p 2 « k (r) ; 
thus the wave equation (16) may be written as an equation for u k : 

(p + ft k) 2 + j u k( r ) - e k «k(r) . (17) 

Atk = 0 we have </> 0 = u 0 ( r), where « c (r) has the periodicity of the lattice, sees 
the ion cores, and near them will look like the wavefunction of the free atom. 

It is much easier to find a solution at k = 0 than at a general k, because al 
k = 0 a nondegenerate solution will have the full symmetry of U( r), that is, o) 
the crystal. We can then use w 0 ( r ) to construct the approximate solution 

i// k = exp(ik • r)u 0 (r) . (18) 

This is of the Bloch form, but u 0 is not an exact solution of (1 7): it is a solution 
only if we drop the term in k-p. Often this term is treated as a perturbation, as 
in Problem 8. The k*p perturbation theory developed there is especially useful 
in finding the effective mass m* at a band edge. 

Because it takes account of the ion core potential the function (18) is a 
much better approximation than a plane wave to the correct wavefunction 
The energy of the approximate solution depends on k as ( hk) 2 /2im, , exactly as 
for the plane wave, even though the modulation represented by u 0 (r) may be 
very strong. Because u 0 is a solution of 

\2m p2 + ^*7 u ^ r ' = e o M o(r) , (19) 

the function (18) has the energy expectation value e 0 + (h 2 k 2 /2m). The 
function w 0 (r) often will give us a good picture of the charge distribution 
within a cell. 
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Metal, k - 0 
Free atom 

Metal, k at Brillouin 
zone boundary 


0 12 3 4 

r (Bohr units) 

Figure 19 Radial wavefunctions for the 3s orbital of free sodium atom and for the 3s conduction 
band in sodium metal The wavefunctions, which are not normalized here, are found by integrat- 
ing the Schrodinger equation for an electron in the potential well of an Na* ion core. For the free 
atom the wavefunction is integrated subject to the usual Schrodinger boundary condition </*(?") --> 0 
as r -¥ »; the energy eigenvalue is -5.15 eV. The wavefunction for wavevector k = 0 in the metal 
is subject to the Wigner-Seitz boundary condition that dif/idr = 0 when r is midway between 
neighboring atoms; the energy' of this orbital is —8.2 eV, considerably lower than for the free 
atom. The orbitals at the zone boundary are not filled in sodium; their energy is +2.7 eV. (After 
E. Wigner and F. Seitz.) 



Wigner and Seitz developed a simple and fairly accurate method of calcu- 
lating « 0 (r). Figure 19 shows the Wigner-Seitz wavefunction for k = 0 in the 
3s conduction band of metallic sodium. The function is practically constant 
over 0.9 of the atomic volume. To the extent that the solutions for higher k 
may be approximated by exp(ik • r)u 0 (r), the wavefunctions in the conduction 
band will he similar to plane waves over most of the atomic volume, but in- 
crease markedly and oscillate within the ion core. 

Cohesive Energy. The stability of the simple metals with respect to free 
atoms is caused by the lowering of the energy of the Bloch orbital with k = 0 
in the crystal compared to the ground valence orbital of the free atom. The 
effect is illustrated in Fig. 19 for sodium and in Fig. 20 for a linear periodic 
potential of attractive square wells. The ground orbital energy is much lower 
(because of lower kinetic energy) at the actual spacing in the metal than for 
isolated atoms. 

A decrease in ground orbital energy will increase the binding. The decrease 
in ground orbital energy is a consequence of the change in the boundary condi- 
tion on the wavefunction: The Schrodinger boundary condition for the free 
atom is ip(r) -+ 0 as r — > In the crystal the k = 0 wavefunction « 0 (r) has the 
symmetry of the lattice and is symmetric about r = 0. To have this, the normal 
derivative of i/r must vanish across every plane midway between adjacent atoms. 
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Figure 20 Ground orbital (k = 0) energy for an electron in a periodic square well potential of 
depth | U 0 1 = 2h~/nur. The energy is lowered as the wells come closer together. Here a is held con- 
stant and b is varied. Large b/a corresponds to separated atoms. (Courtesy of C. Y. Fong.) 


Figure 21 Cohesive energy of sodium metal is the dif- 
ference between the average energy of an electron in the 
metal ( — 6.3 eV) and the ground state energy (—5.15 eV) 
of the valence 3s electron in the free atom, referred to an 
Na + ion plus free electron at infinite separation. 


-5.15 eV 


Atom 


Ground 


state 




Metal 


Fermi level 


-6.3 eV 

Average energy 


-8.2 eV 

k = 0 state 


In a spherical approximation to the shape of the smallest Wigner-Seitz cell 
we use the Wigner-Seitz boundary condition 

{difi/dr ) ro = 0 , (20) 

where r 0 is the radius of a sphere equal in volume to a primitive cell of the lat- 
tice. In sodium, r 0 = 3.95 Bohr units, or 2.08 A; the half distance to a nearest 
neighbor is 1.86 A. The spherical approximation is not bad for fee and bcc 
structures. The boundary condition allows the ground orbital wavefunction to 
have much less curvature than the free atom boundary condition. Much less 
curvature means much less kinetic energy. 

In sodium the other filled orbitals in the conduction band can be repre- 
sented in a rough approximation by wavefunctions of the form (18), with 

A = e u 0 (r) ■ e k = e 0 + . 
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The Fermi energy is 3.1 eV. from Table 6.1. The average kinetic energy per 
electron is 0.6 of the Fermi energy, or 1.9 eV. Because e 0 = —8.2 eV at k = 0, 
the average electron energy is {e ; .} — —8.2 + 1.9 = —6.3 eV, compared with 
—5.15 eV for the valence electron of the free atom, Fig. 21. 

We therefore estimate that sodium metal is stable by about 1.1 eV with 
respect to the free atom. This result agrees well with the experimental value 
1.13 eV. 

Pseudopotential Methods 

Conduction electron wavefunctions are usually smoothly varying in the re- 
gion between the ion cores, but have a complicated nodal structure in the re- 
gion of the cores. This behavior is illustrated by the ground orbital of sodium, 
Fig. 19. It is helpful to view the nodes in the conduction electron wavefunction 
in the core region as created by the requirement that the function be ortho- 
gonal to the wavefunctions of the core electrons. This all comes out of the 
Schrodinger equation, but we can see that wc need the flexibility of two nodes 
in the 3s conduction orbital of Na in order to be orthogonal both to the Is core 
orbital with no nodes and the 2s core orbital with one node. 

Outside the core the potential energy that acts on the conduction electron 
is relatively weak: the potential energy is only the coulomb potential of the 
singly-charged positive ion cores and is reduced markedly by the electrostatic 
screening of the other conduction electrons. Chapter 14. In this outer region 
the conduction electron wavefunctions are as smoothly varying as plane waves. 

If the conduction orbitals in this outer region are approximately plane 
waves, the energy must depend on the wavevector approximately as 
e k = fi 2 k 2 /2m as for free electrons. But how do we treat the conduction orbitals 
in the core region where the orbitals are not at all like plane waves? 

What goes on in the core is largely irrelevant to the dependence of e on k. 
Recall that we can calculate the energy by applying the hamiltonian operator 
to ail orbital at any point in space. Applied in the outer region, this operation 
will give an energy' nearly equal to the free electron energy. 

This argument leads naturally to the idea that we might replace the actual 
potential energy 7 (and filled shells) in the core region by an effective potential 
energy 1 that gives the same wavefunctions outside the core as are given 
by the actual ion cores. It is startling to find that the effective potential or 


J. C. Phillips and L. Kleinman, Phys. Ecv. 116, 287 (1959); E. Antoncik, J. Phys. Chern. 
Solids 10. 314 (1959). The general theory of pseudopotentials is discussed by B. J. Austin, 
V. Heine, and L. J. Sham, Phys. Rev. 127, 276 (1962); see also Vol. 24 of Solid state physics. The 
utility of the empty core model has been known for many years; it goes back to E. Fermi, Nuovo 
Cimento 2, 157 (19.34); H, Hellmaun, Acta Physiochimica URSS 1, 913 (1935); and H. Hellmann 
and W. Kassatotschkin, J. Chem. Phys. 4, 324 (1936), who wrote “Since the field of the ion 
determined in this way runs a rather flat course, it is sufficient in the first approximation to set the 
valence electron in the lattice equal to a plane wave." 
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pseudopotential that satisfies this requirement is nearly zero. This conclusion 
about pseudopotentials is supported by a large amount of empirical experience 
as well as by theoretical arguments. The result is referred to as the cancella- 
tion theorem. 

The pseudopotential for a problem is not unique nor exact, but it may 
be very good. On the Empty Core Model (ECM) we can even take the un- 
screened pseudopotential to be zero inside some radius R e : 

U(r) - |° g ’ r° r r ^5 ; (21) 

y-er/r , forr > R e . 

This potential should now be screened as described in Chapter 10. Each com- 
ponent l/(K) of U(r) is to be divided by the dielectric constant e(K) of the 
electron gas. If, just as an example, we use the Thomas-Fermi dielectric func- 
tion (14.33), we obtain the screened pseudopotential plotted in Fig. 22a. 



Figure 22a Pseudopotential for metallic sodium, based on the empty core model and screened 
by the Thomas-Fermi dielectric function. The calculations were made for an empty core radius 
R, = 1.66a 0 , where a 0 is the Bohr radius, and for a screening parameter k^a 0 = 0.79. The dashed 
curve shows the assumed unscreened potential, as from (21). The dotted curve is the actual 
potential of the ion core; other values of U(r) are —50.4, —11.6, and -4.6, for r = 0.15, 0.4, and 
0.7, respectively. Thus the actual potential ol the ion (chosen to lit the energy levels of the free 
atom) is very much larger than the pseudopotential, over 200 times larger at r = 0.15. 
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Figure 22b A typical reciprocal space 
pseudopotential. Values of C7(k) for wavevec- 
tors equal to the reciprocal lattice vectors, G, 
arc indicated by the dots. For very small k the 
potential approaches (—2/3) times the Fermi 
energy, which is the screened-ion limit for 
metals. (After M. L. Cohen.) 


The pseudopotential as drawn is much weaker than the true potential, but 
the pseudopotential was adjusted so that the wavefunetion in the outer region 
is nearly identical to that for the true potential. In the language of scattering 
theory, we adjust the phase shifts of the pseudopotential to match those of the 
true potential. 

Calculation of the band structure depends only on the Fourier compo- 
nents of the pseudopotential at the reciprocal lattice vectors. Usually only a 
few values of the coefficients U( G) are needed to get a good band structure: 
see the [7(G) in Fig. 22b. These coefficients arc sometimes calculated from 
model potentials, and sometimes they are obtained from fits of tentative band 
structures to the results of optical measurements. Good values of [7(0) can be 
estimated from first principles; it is shown in (14.43) that for a screened 
coulomb potential (7(0) = — §e F . 

In the remarkably successful Empirical Pseudopotential Method (EPM) 
the band structure is calculated using a few coefficients [7(G) deduced from 
theoretical fits to measurements of the optical reflectance and absorption of 
crystals, as discussed in Chapter 15. Charge density maps can be plotted from 
the wavefunetions generated by the EPM — see Fig. 3.11. The results are in 
excellent agreement with x-ray diffraction determinations; such maps give an 
understanding of the bonding and have great predictive value for proposed 
new structures and compounds. 

The EPM values of the coefficients [7(G) often are additive in the contri- 
butions of the several types of ions that are present. Thus it may be possible to 
construct the [7(G) for entirely new structures, starting from results on known 
structures. Further, the pressure dependence of a band structure may be de- 
termined when it is possible to estimate from the form of the U(r) curve the 
dependence of [7(G) on small variations of G. 
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It is often possible to calculate band structures, cohesive energy, lattice 
constants, and bulk moduli from first principles. Tn such ab initio pseudo- 
potential calculations the basic inputs are the crystal structure type and the 
atomic number, along with well-tested theoretical approximations to exchange 
energy terms. This is not the same as calculating from atomic number alone, 
but it is the most reasonable basis for a first-principles calculation. The results 
of Yin and Cohen are compared with experiment in the table that follows. 



Lattice 
constant (A) 

Cohesive 
energy (eV) 

Bulk modulus 
(Mbar) 

Silicon 

Calculated 

5.45 

4.84 

0.98 

Experimental 

5.43 

4.63 

0.99 

Germanium 

Calculated 

5.66 

4.26 

0.73 

Experimental 

5.65 

3.85 

0.77 

Diamond 

Calculated 

3.60 

8.10 

4.33 

Experimental 

3.57 

7.35 

4.43 


EXPERIMENTAL METHODS IN FERMI SURFACE STUDIES 

Powerful experimental methods have been developed for the determina- 
tion of Fermi surfaces. The methods include magnetoresistance, anomalous 
skin effect, cyclotron resonance, magneto -acoustic geometric effects, the 
Shubnikow-de Haas effect, and the de Haas-van Alphen effect. Further infor- 
mation on the momentum distribution is given by positron annihilation, 
Compton scattering, and the Kohn effect. 

We propose to study one method rather thoroughly. All the methods are 
useful, but need detailed theoretical analysis. We select the de Haas-van 
Alphen effect because it exhibits very well the characteristic periodicity in l/D 
of the properties of a metal in a uniform magnetic field. 

Quantization of Orbitx in a Magnetic Field 

The momentum p of a particle in a magnetic field is the sum (Appendix G) 
of two parts, the kinetic momentum p kill = mv = fik and the potential momen- 
tum or field momentum p fle | d = qAJc, where q is the charge. The vector poten- 
tial is related to the magnetic field by B = curl A. The total momentum is 

(CGS) P = Plan + Pfield = + qA/C . 

In SI the factor c ~ 1 is omitted. 


( 22 ) 
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Following the semiclassical approach of Onsager and Lifshitz, we assume 
that the orbits in a magnetic field are quantized by the Bohr-Sommerfeld relation 

p • dr — (n + y)2rrh , (23) 

when n is an integer and y is a phase correction that for free electrons has the 
value Then 

<j> p • dr = • dr + ^ ■ dr . (24) 

The equation of motion of a particle of charge q in a magnetic field is 

= . (25a) 

at c at 

We integrate with respect to time to give 

ftk = |r X B , 

apart from an additive constant which does not contribute to the final result. 
Thus one of the path integrals in (24) is 

fik • dr = ^ r X B • dr — — ■ |r X dr - — <t> , (25b) 

where 4> is the magnetic flux contained within the orbit in real space. We have 
used the geometrical result that 

j> r X dr = 2X (area enclosed by the orbit) . 

The other path integral in (24) is 

|r A • dr = ^ J curl A ■ da = ^ J B ■ da — ^ , (25c) 

by the Stokes theorem. Here dtr is the area element in real space. The momen- 
tum path integral is the sum of (25b) and (25c): 

<j> p • dr — — = (n + y)2Trh . (26) 

It follows that the orbit of an electron is quantized in such a way that the 
flux through it is 

= (n + y)(2i rhe/e) . (27) 

The flux unit 27 rhe/e — 4.14 X 10” 7 gauss cm 2 or T m 2 . 

In the de Haas-van Alphen effect discussed below we need the area of the 
orbit in wavevector space. We obtained in (27) the flux through the orbit in 
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real space. By (25a) we know that a line element A r in the plane normal to B is 
related to A k by Ar = ( hc/eB)Ak , so that the area S n in k space is related to the 
area A n of the orbit in r space by 

A,, = (hc/eB)% . (28) 

It follows that 


from (27), whence the area of an orbit in k space wall satisfy 

S„ = (n + y)^B . (30) 

In Fermi surface experiments we may be interested in the increment A B 
for which two successive orbits, n and n + 1, have the same area in k space on 
the Fermi surface. The areas are equal when 


1 1_\ _ 2ve 

E n+l E n J fie 


(31) 


from (30). We have the important result that equal increments of 1/B repro- 
duce similar orbits — this periodicity in 1/B is a striking feature of the magneto- 
oscillatory effects in metals at low temperatures: resistivity, susceptibility, heat 
capacity. 

The population of orbits on or near the Fermi surface oscillates as B is var- 
ied, causing a wide variety of effects. From the period of the oscillation we 
reconstruct the Fermi surface. The result (30) is independent of the gauge of 
the vector potential used in the expression (22) for momentum; that is, p is not 
gauge invariant, but S n is. Gauge invariance is discussed further in Chapter 10 
and in Appendix G. 


De Ham-van Alphen Effect 

The de Haas-van Alphen effect is the oscillation of the magnetic moment 
of a metal as a function of the static magnetic field intensity. The effect can be 
observed in pure specimens at low temperatures in strong magnetic fields: we 
do not want the quantization of the electron orbits to be blurred by collisions, 
and we do not want the population oscillations to be averaged out by thermal 
population of adjacent orbits. 

The analysis of the dHvA effect is given for absolute zero in Fig. 23. The 
electron spin is neglected. The treatment is given for a two-dimensional (2D) 
system; in 3D we need only multiply the 2D wavefunction by plane wave factors 
exp (ik z z), where the magnetic field is parallel to the z axis. The area of an orbit in 
k„ k y space is quantized as in (30). The area between successive orbits is 

AS = S„ — S n ~i = 2t reB/hc . (32) 
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(a) (b) (c) (d) (e) 


Figure 23 Explanation of the de Haas-van Alphen effect for a free electron gas in two dimen- 
sions in a magnetic field. The filled orbitals of the Fermi sea in the absence of a magnetic field are 
shaded in a and d. The energy levels in a magnetic field are shown in b, c, and e. In b the field has 
a value B , such that the total energy of the electrons is the same as in the absence of a magnetic 
field: as many electrons have their energy raised as lowered by the orbital quantization in the mag- 
netic field Bj. When we increase the field to B 2 the total electron energy is increased, because the 
uppermost electrons have their energy raised. In e for field B 3 the energy is again equal to that for 

the field B = 0. The total energy is a minimum at points such as B lt B 3 , B 5 and a maximum 

near points such as B 2 , B 4) .... 

The area in k space occupied by a single orbital is (27 r/L) 2 , neglecting spin, 
for a square specimen of side L. Using (32) we find that the number of free 
electron orbitals that coalesce in a single magnetic level is 

D = (27reB/hc)(L/2iT) z = pB , (33) 

where p = eL 2 /2irhc, as in Fig. 24. Such a magnetic level is called a Landau 
level. 

The dependence of the Fermi level on B is dramatic. For a system of N 
electrons at absolute zero the Landau levels are entirely filled up to a magnetic 
quantum number we identify by s, where s is a positive integer. Orbitals at the 
next higher level s + 1 will be partly filled to the extent needed to accommo- 
date the electrons. The Fermi level will lie in the Landau level s + 1 if there 
are electrons in this level; as the magnetic field is increased the electrons move 
to lower levels. When s + 1 is vacated, the Fermi level moves down abruptly 
to the next lower level s. 

The electron transfer to lower Landau levels can occur because their 
degeneracy D increases as B is increased, as shown in Fig. 25. As B is 
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(a) (b) 

Figure 24 (a) Allowed electron orbitals in two dimensions in absence of a magnetic field, (b) In a 
magnetic field the points which represent the orbitals of free electrons may be viewed as re- 
stricted to circles in the former k,k tJ plane. The successive circles correspond to successive values 
of the quantum number n in the energy (n — § )ka> c . The area between successive circles is 

(CGS) ir\{k 2 ) = 2-irk(Ak ) = ( Zirni/h 2 ) Ae = 27 vnuajfi — 2ireB/hc . 

The angular position of the points has no significance. The number of orbitals on a circle is con- 
stant and is equal to the area between successive circles times the number of orbitals per unit area 
in (a), or ( 2TreBlfic)(LI2'iT ) 2 = L 2 eB/2irhc, neglecting electron spin. 



Figure 25 (a) The heavy line gives the number of particles in levels which are completely occu- 
pied in a magnetic field B , for a two-dimensional system with N = 50 and p = 0.50. The shaded 
area gives the number of particles in levels partially occupied. The value of s denotes the quantum 
number of the highest level which is completely filled. Thus at B = 40 we have s = 2; the levels 
n = 1 and n = 2 are filled and there are 10 particles in the level n = 3. At B = 50 the level n = 3 is 
empty, (b) The periodicity in 1 IB is evident when the same points are plotted against 1/B. 


increased there occur values of B at which the quantum number of the upper- 
most filled level decreases abruptly by unity. At the critical magnetic fields 
labeled B s no level is partly occupied at absolute zero, so that 

spB s = N . 


( 34 ) 
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Figure 26 The upper curve is the total electronic energy versus 1 IB. The oscillations in the en- 
ergy U may be detected by measurement of the magnetic moment, given by — dU/dB . The thermal 
and transport properties of the metal also oscillate as successive orbital levels cut through the 
Fermi level when the field is increased. The shaded region in the figure gives the contribution to 
the energy from levels that are only partly filled. The parameters for the figure are the same as for 
Fig. 25, and we have taken the units of B such that B = hto c . 


The number of filled levels times the degeneracy at B s must equal the number 
of electrons N. 

To show the periodicity of the energy as B is varied, we use the result 
that the energy of the Landau level of magnetic quantum number n is 
E n = (n — l)hix) c , where u> c = eB/m*c is the cyclotron frequency. The result for 
E n follows from the analogy between the cyclotron resonance orbits and the 
simple harmonic oscillator, but now we have found it convenient to start 
counting at n = 1 instead of at n = 0. 

The total energy of the electrons in levels that are fully occupied is 

^ Dh(o c {n — |) = | Dfiu ipS 2 , (35) 

n = l 

where D is the number of electrons in each level. The total energy of the 
electrons in the partly occupied level s + 1 is 

Eco c (s + l)(N - sD) , (36) 


where sD is the number of electrons in the lower filled levels. The total energy 
of the N electrons is the sum of (35) and (36), as in Fig. 26. 

The magnetic moment g of a system at absolute zero is given by p. = 
— dU/dB. The moment here is an oscillatory function of 1 IB, Fig. 27. This os- 
cillatory magnetic moment of the Fermi gas at low temperatures is the de 
Haas-van Alphen effect. From (31) we see that the oscillations occur at equal 
intervals of 1 IB such that 


B 


2ire 

ftcS 


( 37 ) 
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Figure 27 At absolute zero the magnetic moment is given by —dUdB. The energy plotted in 
Fig. 26 leads to the magnetic moment shown here, an oscillatoiy function of 1/B. In impure speci- 
mens the oscillations are smudged out in part because the energy levels are no longer sharply defined. 


Figure 28 The orbits in the section AA' are ex- 
tremal orbits: the cyclotron period is roughly con- 
stant over a reasonable section of the Fermi surface. 
Other sections such as BB ' have orbits that vary in 
period along the section. 



where S is the extremal area (see below) of the Fermi surface normal to the di- 
rection of B. From measurements of A(l/B), we deduce the corresponding ex- 
tremal areas S; thereby much can be inferred about the shape and size of the 
Fermi surface. 

Extremal Orbits. One point in the interpretation of the dHvA effect is sub- 
tle. For a Fermi surface of general shape the sections at different values of A: B 
will have different periods. Here k B is the component of k along the direction 
of the magnetic field. The response will be the sum of contributions from all 
sections or all orbits. But the dominant response of the system comes from or- 
bits whose periods are stationary with respect to small changes in k B . Such 
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orbits are called extremal orbits. Thus, in Fig. 28 the section AA’ dominates 
the observed cyclotron period. 

The argument can be put in mathematical form, but we do not give the 
proof here (QTS y p. 223). Essentially it is a question of phase cancellation; the 
contributions of different nonextremal orbits cancel, but near the extrema 
the phase varies only slowly and there is a net signal from these orbits. Sharp 
resonances are obtained even from complicated Fermi surfaces because the 
experiment selects the extermal orbits. 

Fermi Surface of Copper. The Fermi surface of copper (Fig. 29) is distinctly 
nonspherical: eight necks make contact with the hexagonal faces of the first 
Brillouin zone of the fee lattice. The electron concentration in a monovalent 
metal with an fee structure is n = 4/a 3 ; there are four electrons in a cube of 
volume a 3 . The radius of a free electron Fermi sphere is 

k F = (37 r^n) 173 = (127 T 2 ’/a 3 ) u ’ i = (4.90 /a) , (38) 


and the diameter is 9.80/a. 

The shortest distance across the Brillouin zone (the distance between 
hexagonal faces) is (27r/a)(3) 1/2 = 10.88/a, somewhat larger than the diameter 
of the free electron sphere. The sphere does not touch the zone boundary, but 
we know that the presence of a zone boundary tends to lower the band energy 
near the boundary. Thus it is plausible that the Fermi surface should neck out 
to meet the closest (hexagonal) faces of the zone (Figs. 18 and 29). 

The square faces of the zone are more distant, with separation 12.57/a, 
and the Fermi surface does not neck out to meet these faces. 


EXAMPLE: Fermi Surface of Gold. In gold for quite a wide range of field directions 
Shoenberg finds the magnetic moment has a period of 2 X 10“ ' gauss" 1 . This period 
corresponds to an extremal orbit of area 

= a 9.55 X iff _ ]0 ,« cni -2 

A(1 /B) 2X10" 9 

From Table 6.1, we have k F = 1.2 X 10 s cm" 1 for a free electron Fermi sphere for 
gold, or an extremal area of 4.5 X lO 10 cm -2 , in general agreement with the experi- 
mental value. The actual periods reported by Shoenberg are 2.05 X 10 9 gauss" 1 and 
1.95 X 10 -9 gauss -1 . In the [111] direction in Au a large period of 6 X 10“ 8 gauss" 1 is 
also found; the corresponding orbital area is 1.6 X 10 15 cm" 3 . This is the “neck” orbit 

N. Another extremal orbit, the “dog's bone,” is shown in Fig. 30; its area in Au is about 

O. 4 of the belly area. Experimental results are shown in Fig. 31. To do the example in 
SI, drop a from the relation for S and use as the period 2 X 10 5 tesla" 1 . 
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Figure 29 Fermi surface of copper, after Pippard. The 
Brillouin zone of the fee structure is the truncated octa- 
hedron derived in Chapter 2. The Fermi surface makes 
contact with the boundary at the center of the hexagonal 
faces of the zone, in the [111] directions in k space. Two 
"belly” extremal orbits are shown, denoted by B; the 
extremal "neck” orbit is denoted by N. 


Figure 30 Dog’s bone orbit of an electron on 
the Fermi surface of copper or gold in a mag- 
netic field. This orbit is classified as holelike be- 
cause the energy increases toward the interior of 
the orbit. 



Figure 31 De Haas-van Alphen effect in gold with B II [110]. The oscillation is from the dog’s 
bone orbit of Fig. 30. The signal is related to the second derivative of the magnetic moment with 
respect to field. The results were obtained by a field modulation technique in a high-homogeneity 
superconducting solenoid at about 1.2 K. (Courtesy of I. M. Templeton.) 

The free electron Fermi sphere of aluminum fills the first zone entirely 
and has a large overlap into the second and third zones, Fig. 1. The third zone 
Fermi surface is quite complicated, even though it is just made up of certain 
pieces of the surface of the free electron sphere. The free electron model also 
gives small pockets of holes in the fourth zone, but when the lattice potential is 
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Figure 32 Multiply connected hole surface of magnesium in 
bands 1 and 2. according to L. M. Falicov. (Drawing by Marta 
Puebla.) 


High magnetic field 



Weak magnetic field 


0 0 0 5 



Figure 33 Breakdown of band structure by a strong magnetic field. Brillouin zone boundaries 
are the light lines. The free electron orbits (a) in a strong field change connectivity in a weak field 
(b) to become open orbits in the first band and electron orbits in the second band. Both bands are 
mapped together. 


taken into account these empty’ out, the electrons being added to the third 
zone. The general features of the predicted Fermi surface of aluminum are 
quite well verified by experiment. Figure 32 shows part of the free electron 
Fermi surface of magnesium. 

Magnetic Breakdown. Electrons in sufficiently high magnetic fields will 
move in free particle orbits, the circular cyclotron orbits of Fig. 33a. Here the 
magnetic forces are dominant, and the lattice potential is a slight perturbation. 
In this limit the classification of the orbitals into bands may have little impor- 
tance. However, we know that at low magnetic fields the motion is described by 
(8.7) with the band structure e k that obtains in the absence of a magnetic field. 

The eventual breakdown of this description as the magnetic field is in- 
creased is called magnetic breakdown. The passage to strong magnetic fields 
may drastically change the connectivity of the orbits, as in the figure. The 
onset of magnetic breakdown will be revealed by physical properties such as 





252 


magnetoresistance that depend sensitively on the connectivity. The condition 
for magnetic breakdown is that hw c e F > Eg, approximately. Here e F is the free 
electron Fermi energy and E g is the energy gap. This condition is much milder, 
especially in metals with small gaps, than the naive condition that the mag- 
netic splitting ha> c exceed the gap. 

Small gaps may be found in hep metals where the gap across the hexagonal 
face of the zone would be zero except for a small splitting introduced by the 
spin-orbit interaction. In Mg the splitting is of the order of 10 1 eV; for this gap 
and €// ~ 10 eV the breakdown condition is fuo c > Kr 5 eV, or B > 1000 G. 


SUMMARY 


• A Fermi surface is the surface in k space of constant energy equal to € f . The 
Fermi surface separates filled states from empty states at absolute zero. 
The form of the Fermi surface is usually exhibited best in the reduced zone 
scheme, but the connectivity of the surfaces is clearest in the periodic zone 
scheme. 

• An energy band is a single branch of the e fc versus k surface. 

• The cohesion of simple metals is accounted for by the lowering of energy of 
the k = 0 conduction band orbital when the boundary conditions on the 
wavefunction are changed from Schrodinger to Wigner-Seitz. 

• The periodicity in the de Haas-van Alphen effect measures the extremal 
cross-section area S in k space of the Fermi surface, the cross section being 
taken perpendicular to B: 


(CCS) 



2 ire 
hcS 


Problems 

1. Brillouin zones of rectangular lattice. Make a plot of the first two Brillouin 
zones of a primitive rectangular two-dimensional lattice with axes a, b = 3s. 

2. Brillouin zone, rectangular lattice. A two-dimensional metal has one atom of 
valency one in a simple rectangular primitive cell a = 2 A; b = 4 A. (a) Draw the 
first Brillouin zone. Give its dimensions, in cm' 1 . (b) Calculate the radius of the 
free electron Fermi sphere, in cm -1 , (c) Draw this sphere to scale on a drawing of 
the first Brillouin zone. Make another sketch to show the first few periods of the 
free electron band in the periodic zone scheme, for both the first and second en- 
ergy bands. Assume there is a small energy gap at the zone boundary. 

3. Hexagonal close-packed structure. Consider the first Brillouin zone of a crystal 
with a simple hexagonal lattice in three dimensions w ith lattice constants a and c. 
Let G f denote the shortest reciprocal lattice vector parallel to the c axis of the 
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crystal lattice, (a) Show that for a hexagonal-close-packed crystal structure the 
Fourier component U(G C ) of the crystal potential U(r) is zero, (b) Is U(2G C ) also 
zero? (c) Why is it possible in principle to obtain an insulator made up of divalent 
atoms at the lattice points of a simple hexagonal lattice? (d) Why is it not possible 
to obtain an insulator made up of monovalent atoms in a hexagonal-close-packed 
structure? 

4. Brillouin zones of two-dimensional divalent metal. A two-dimensional metal 
in the form of a square lattice has two conduction electrons per atom. In the al- 
most free electron approximation, sketch carefully the electron and hole energy 
surfaces. For the electrons choose a zone scheme such that the Fermi surface is 
shown as closed. 

5. Open orbits. An open orbit in a monovalent tetragonal metal connects 
opposite faces of the boundary of a Brillouin zone. The faces are separated by G 
= 2 X 10 s cm 1 . A magnetic field B = 10 :! gauss = 10 -1 tesla is normal to the 
plane of the open orbit, (a) What is the order of magnitude of the period of the 
motion in k space? Take v ~ 10 s cm/sec. (b) Describe in real space the motion of 
an electron on this orbit in the presence of the magnetic field. 

6. Cohesive energy for a square well potential, (a) Find an expression for the 
binding energy of an electron in one dimension in a single square well of depth U a 
and width a. (This is the standard first problem in elementary quantum mechan- 
ics.) Assume that the solution is symmetric about the midpoint of the well, (b) 
Find a numerical result for the binding energy in terms of U 0 for the special case 
|I/ 0 | = 2 fi 2 /ma 2 and compare with the appropriate limit of Fig. 20. In this limit of 
widely separated wells the band width goes to zero, so the energy for k = 0 is the 
same as the energy for any other k in the lowest energy band. Other bands are 
formed frum the excited states of the well, in this limit. 

7. De Haas- van Alphen period of potassium, (a) Calculate the period A(l/B) ex- 
pected for potassium on tlic free electron model, (b) What is the area in real 
space of the extremal orbit, for B = 10 kG — IT? The same period applies to os- 
cillations in the electrical resistivity', known as the Shubnikow-de Haas effect. 

*8. Band edge structure on k • p perturbation theory. Consider a nondegenerate 
orbital k at k = 0 in the band n of a cubic crystal. Use second-order perturba- 
tion theory to find the result 


e„(k) = e„(0) + 


2m 


m j 


Kn0|k • p|j~0)p 
e„(0) - e/0) 


(39) 


where the sum is over all other orbitals ipjV at k = 0. The effective mass at this 
point is 


m_ = , 2 y, l(”-0|pl;0>l S 

m* ™ f e n (0) - 6,(0) ■ 


(40) 


‘This problem is somewhat difficult. 
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The mass at the conduction band edge in a narrow gap semiconductor is often 
dominated by the effect of the valence band edge, whence 

f-^SlfclrW. (41) 

where the sum is over the valence bands; E g is the energy gap. For given matrix 
elements, small gaps lead to small masses. 

9. Wannier functions. The Wannier functions of a band are defined in terms of 
the Bloch functions of the same hand by 


w(r - r„) = N 1/2 2 exp(- ik ■ rj i/» k (r) , (42) 

k 

where r n is a lattice point, (a) Prove that Wannier functions about different lattice 
points n,m are orthogonal: 

f dV w*(r — r„)u>(r — r m ) = 0 , n+m. (43) 

This orthogonality property makes the functions often of greater use than atomic 
orbitals centered on different lattice sites, because the latter are not generally or- 
thogonal. (b) The Wannier functions are peaked around the lattice sites. Show 
that for i Ji k = N ~ 112 f: ,kt u c (r) the Wannier function is 


w(x - x„) = u 0 (x) 


sin 7 t(x — x n )/a 
n{x - x u )/a 


for N atoms on a line of lattice constant a. 


10. Open orbits and magnetoresistance. We considered the transverse magneto- 
resistance of free electrons in Problem 6.9 and of electrons and holes in Problem 
8.5. In some crystals the magnetoresistance saturates except in special crystal ori- 
entations. An open orbit carries current only in a single direction in the plane 
normal to the magnetic field; such carriers are not deflected by the field. In the 
arrangement of Fig. 6.14, let the open orbits be parallel to k x ; in real space these 
orbits carry' current parallel to the y axis. Let oy,, = s<t 0 be the conductivity of the 
open orbits; this defines the constant s. The magnetoconductivity tensor in the 
high field limit w c j > 1 is 


/<?-* -p- 1 0\ 
cr 0 £ 1 s 0 , 

\ 0 0 V 


with (1 = w r T. (a) Show that the Hall field is E y = —EJsQ. (b) Show that the ef- 
fective resistivity in the x direction is p — (Q 2 /(r a )(s/s + 1), so that the resistivity 
does not saturate, but increases as B 2 . 

11. Landau levels. The vector potential of a uniform magnetic field Bz is A = 
—Byx in the Landau gauge. The hamiltonian of a free electron without spin is 


H = — (ft z /2 m)(d 2 /dy 2 + d 2 /dz 2 ) + (1/2 m)[-ihd/dx — eyB/cf . 
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We will look for an eigenfunction of the wave equation Hip = eip in the form 

ip = x(y) expLil/c.r + kjz)] . 

(a) Show that x(y) satisfies the equation 

(h 2 /2m)cl?x/dy 2 + [e — (ft 2 fc?/2m) — | rruo^iy - yof\x ~ 1' » 

where a> c = eBImc and t/ 0 ~ chkjeB. (b) Show that this is the wave equation of a 
harmonic oscillator with frequency at,., where 

e„ = {n + i)hat c + ft 2 £?/2m . 
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Figure 1 Resistance in ohms of a specimen of mercury versus absolute temperature. This plot by 
Kamerlingh Onnes marked the discovery of superconductivity'. 





CHAPTER 10: SUPERCONDUCTIVITY 


The electrical resistivity of many metals and alloys drops suddenly to zero 
when the specimen is cooled to a sufficiently low temperature, often a temper- 
ature in the liquid helium range. This phenomenon, called superconductivity, 
was observed first by Kamerlingh Onnes in Leiden in 1911, three years after 
he first liquified helium. At a critical temperature T c the specimen undergoes 
a phase transition from a state of normal electrical resistivity to a supercon- 
ducting state, Fig. 1. 

Superconductivity' is now very well understood. It is a field with many 
practical and theoretical aspects. The length of this chapter and the relevant 
appendices reflect the richness and subtleties of the field. 


EXPERIMENTAL SURVEY 

In the superconducting state the dc electrical resistivity is zero, or so close 
to zero that persistent electrical currents have been observed to flow without 
attenuation in superconducting rings for more than a year, until at last the ex- 
perimentalist wearied of the experiment. 

The decay of supercurrents in a solenoid was studied by File and Mills 
using precision nuclear magnetic resonance methods to measure the magnetic 
field associated with the supercurrent. They concluded that the decay time of 
the supercurrent is not less than 100,000 years. We estimate the decay time 
below. In some superconducting materials, particularly those used for super- 
conducting magnets, finite decay times are observed because of an irreversible 
redistribution of magnetic flux in the material. 

The magnetic properties exhibited by superconductors are as dramatic as 
their electrical properties. The magnetic properties cannot be accounted for 
by the assumption that a superconductor is a normal conductor with zero elec- 
trical resistivity. 

It is an experimental fact that a hulk superconductor in a w r eak magnetic 
field will act as a perfect diamagnet, with zero magnetic induction in the inte- 
rior. When a specimen is placed in a magnetic field and is then cooled through 
the transition temperature for superconductivity, the magnetic flux originally 
present is ejected from the specimen. This is called the Meissner effect. The 
sequence of events is shown in Fig. 2. The unique magnetic properties of su- 
perconductors are central to the characterization of the superconducting state. 

The superconducting state is an ordered state of the conduction electrons 
of the metal. The order is in the formation of loosely associated pairs of elec- 
trons. The electrons are ordered at temperatures below the transition temper- 
ature, and they are disordered above the transition temperature. 


Figure 2 Meissner effect in a superconducting sphere cooled in a constant applied magnetic field; 
on passing below the transition temperature the lines of induction B are ejected from the sphere. 


The nature and origin of the ordering was explained by Bardeen, Cooper, 
and Schrieffer. 1 In the present chapter we develop as far as we can in an ele- 
mentary way the physics of the superconducting state. We shall also discuss 
the basic physics of the materials used for superconducting magnets, but not 
their technology. Appendices H and I give deeper treatments of the super- 
conducting state. 

Occurrence of Superconductivity 

Superconductivity occurs in many metallic elements of the periodic system 
and also in alloys, intermetallic compounds, and doped semiconductors. The 
range of transition temperatures best confirmed at present extends from 90.0 K 
for the compound YBa 2 Cu 3 0 7 to below 0.001 K for the element Rh. Several 
/-band superconductors, also known as “exotic superconductors,” are listed in 
Chapter 6. Several materials become superconducting only under high pres- 
sure; for example. Si has a superconducting form at 165 kbar, with T c = 8.3 K. 
The elements known to be superconducting are displayed in Table 1, for zero 
pressure. 

Will every nonmagnetic metallic element become a superconductor at 
sufficiently low temperatures? We do not know. In experimental searches for 
superconductors with ultralow transition temperatures it is important to 
eliminate from the specimen even trace quantities of foreign paramagnetic 
elements, because they can lower the transition temperature severely. One 
part of Fe in 10 4 will destroy the superconductivity of Mo, which when pure 
has T c = 0.92 K; and 1 at. percent of gadolinium lowers the transition temper- 
ature of lanthanum from 5.6 K to 0.6 K. Nonmagnetic impurities have no very 
marked effect on the transition temperature. The transition temperatures of 
a number of interesting superconducting compounds are listed in Table 2. 
Several organic compounds show superconductivity at fairly low temperatures. 


l J. Bardeen, L. N. Cooper, and J. R. Schrieffer, Phys. Rev. 106, 162 (1957); 108, 1175 (1957). 
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Na Mg 


K Ca 


Table 1 Superconductivity parameters of the elements 

An asterisk denotes an element superconducting only in thin films or under 
high pressure in a crystal modification not normally stable. Data courtesy of 
B. T. Matthias, revised by T. Geballe. 
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Table 2 Superconductivity of selected compounds 


Compound 

T,, in K 

Compound 

T,, in K 

Nb 3 Sn 

18.05 

V ;i Ca 

16.5 

NbjGe 

23,2 

V 3 Si 

17.1 

Nb 3 Al 

17,5 

YBa 2 (,u 3 O eo 

90.0 

NbN 

16.0 

F! b,CsC 6(: 

31.3 

C-60 

19.2 

MgB 2 

39.0 


Figure 3 Experimental threshold 
curves of the critical field H C (T) 
versus temperature for several su- 
perconductors. A specimen is super- 
conducting below the curve and 
normal above the curve. 



Temperature, in K 


Destruction of Superconductivity by Magnetic Fields 

A sufficiently strong magnetic field will destroy superconductivity. The 
threshold or critical value of the applied magnetic field for the destruction of 
superconductivity is denoted by H C (T) and is a function of the temperature. At 
the critical temperature the critical field is zero: HJT C ) = 0. The variation of 
the critical field with temperature for several superconducting elements is 
shown in Fig. 3. 

The threshold curves separate the superconducting state in the lower left 
of the figure from the normal state in the upper right. Note: We should denote 
the critical value of the applied magnetic field as B ac , but this is not common 
practice among workers in superconductivity. In the CGS system we shall al- 
ways understand that H c = B ac , and in the SI we have //,. = B a Jfi 0 . The symbol 
B a denotes the applied magnetic field. 

Meissner Effect 

Meissner and Ochsenfeld (1933) found that if a superconductor is cooled 
in a magnetic field to below the transition temperature, then at the transition 
the lines of induction B are pushed out (Fig. 2). The Meissner effect shows 
that a bulk superconductor behaves as if B =0 inside the specimen. 
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We obtain a particularly useful form of this result if we limit ourselves to 
long thin specimens with long axes parallel to B a : now the demagnetizing field 
contribution (see Chapter 16) to B will be negligible, whence: 2 

(CGS) B - B a + 4vM = 0 ; or ~ = ~ ; (1) 

b a 4 TT 

(SI) B = B a + n 0 M = 0; or jf = -i = . 

The result B = 0 cannot be derived from the characterization of a super- 
conductor as a medium of zero resistivity. From Ohm’s law, E = pj, we see that 
if the resistivity p goes to zero while j is held finite, then E must be zero. By a 
Maxwell equation clB/dt is proportional to curl E, so that zero resistivity im- 
plies dB/dt = 0, but not B = 0. This argument is not entirely transparent, but 
the result predicts that the flux through the metal cannot change on cooling 
through the transition. The Meissner effect suggests that perfect diamagnet- 
ism is an essential property of the superconducting state. 

We expect another difference between a superconductor and a perfect 
conductor, defined as a conductor in which the electrons have an infinite mean 
free path. When the problem is solved in detail, it turns out that a perfect 
conductor when placed in a magnetic field cannot produce a permanent eddy 
current screen: the field will penetrate about 1 cm in an hour. 2 

The magnetization curve expected for a superconductor under the condi- 
tions of the Mcissner-Ochsenfeld experiment is sketched in Fig. 4a. This ap- 
plies quantitatively to a specimen in the form of a long solid cylinder placed in 
a longitudinal magnetic field. Pure specimens of many materials exhibit this 
behavior; they are called type I superconductors or, formerly, soft super- 
conductors. The values of H r are always too low for type I superconductors to 
have application in coils for superconducting magnets. 

Other materials exhibit a magnetization curve of the form of Fig. 4b and 
arc known as type II superconductors. They tend to be alloys (as in Fig. oa) 
or transition metals with high values of the electrical resistivity in the normal state: 
that is, the electronic mean free path in the normal state is short. We shall see later 
why the mean free path is involved in the “magnetization” of superconductors. 

Type II superconductors have superconducting electrical properties up to 
a field denoted by H r2 . Between the lower critical field H cl and the upper criti- 
cal field H a2 the flux density B # 0 and the Meissner effect is said to be incom- 
plete. The value of ff c2 may be 100 times or more higher (Fig. 5b) than 


"Diamagnetism, the magnetization M, and the magnetic susceptibility are defined in 
Chapter 14. The magnitude of the apparent diamagnetic susceptibility of bulk superconductors is 
very much larger than in typical diamagnetic substances. In (1), M is the magnetization equivalent 
to the superconducting currents in the specimen. 

3 A. B. Pippard, Dynamics of conduction electrons, Gordon and Breach, 1965. 
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(a) 



( b ) 


Figure 4 (a) Magnetization versus applied magnetic field for a bulk superconductor exhibiting a 
complete Meissner effect (perfect diamagnetism). A superconductor with this behavior is called a 
type I superconductor. Above the critical field H c the specimen is a normal conductor and the mag- 
netization is too small to be seen on this scale. Note that minus 4-TrM is plotted on the vertical scale: 
the negative value of M corresponds to diamagnetism, (b) Superconducting magnetization curve of 
a type II superconductor. The flux starts to penetrate the specimen at a field H cl lower than the 
thermodynamic critical field H c . The specimen is in a vortex state between H cl and H c2 , and it has 
superconducting electrical properties up to H c2 , Above the specimen is a normal conductor in 
every respect, except for possible surface effects. For given H, the area under the magnetization 
curve is the same for a type II superconductor as for a type I. (CCS units in all parts of this figure.) 


the value of the critical field H, calculated from the thermodynamics of the 
transition. In the region between H cl and H, 2 the superconductor is threaded 
by flux lines and is said to be in the vortex state. A field H c2 of 410 kG (41 tes- 
las) has been attained in an alloy of Nb, Al, and Ge at the boiling point of he- 
lium, and 540 kG (54 teslas) has been reported for PbMo 6 S 8 . 

Commercial solenoids wound with a hard superconductor produce high 
steady fields over 100 kG. A “hard superconductor" is a type II superconduc- 
tor with a large magnetic hysteresis, usually induced by mechanical treatment. 
Such materials have an important medical application in magnetic resonance 
imaging (MR1). 

Heat Capacity 

In all superconductors the entropy decreases markedly on cooling below 
the critical temperature T c . Measurements for aluminum are plotted in Fig. 6. 
The decrease in entropy between the normal state and the superconducting 
state tells us that the superconducting state is more ordered than the normal 
state, for the entropy is a measure of the disorder of a system. Some or all of the 
electrons thermally excited in the normal state are ordered in the supercon- 
ducting state. The change in entropy is small, in aluminum of the order of 10 -4 
k B per atom. The small entropy change must mean that only a small fraction (of 
the order of I O' 1 ) of the conduction electrons participate in the transition to 
the ordered superconducting state. The free energies of normal and supercon- 
ducting states are compared in Fig. 7. 
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Applied magnetic Held B a in gauss 


Figure 5a Superconducting magnetization curves of annealed polycrystalline lead and lead- 
indium alloys at 4.2 K. (A) lead; (B) lead-2.08 wt. percent indium; (C) lead-8.23 \vt. percent 
indium; (D) lead-20.4 wl. percent indium. (After Livingston.) 
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Temperature, K 

Figure 6 Entropy S of aluminum in the normal and superconducting states as a function of the 
temperature. The entropy is lower in the superconducting state because the electrons are more or- 
dered here than in the normal state. At any temperature below the critical temperature T c the speci- 
men can be put in the normal state by application of a magnetic field stronger than the critical field. 


The heat capacity of gallium is plotted in Fig. 8: (a) compares the normal 
and superconducting states; (b) shows that the electronic contribution to the 
heat capacity in the superconducting state is an exponential form with an argu- 
ment proportional to — 1 IT, suggestive of excitation of electrons across an en- 
ergy gap. An energy gap (Fig. 9) is a characteristic, but not universal, feature of 
the superconducting state. The gap is accounted for by the Bardeen- 
Cooper-Schrieffer (BCS) theory of superconductivity (see Appendix H). 

Energy Gap 

The energy gap of superconductors is of entirely different origin and na- 
ture than the energy gap of insulators. In an insulator the energy gap is caused 
by the electron-lattice interaction. Chapter 7. This interaction ties the electrons 
to the lattice. In a superconductor the important interaction is the electron- 
electron interaction which orders the electrons in k space with respect to the 
Fermi gas of electrons. 

The argument of the exponential factor in the electronic heat capacity of a 
superconductor is found to be ~E^2k B T and not — EJk B T. This has been 
learnt from comparison with optical and electron tunneling determinations of 
the gap Eg. Values of the gap in several superconductors are given in Table 3. 

The transition in zero magnetic field from the superconducting state to 
the normal state is observed to be a second-order phase transition. At a 
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Temperature, K 

Figure 7 Experimental values of the free energy as a function of temperature for aluminum in 
the superconducting state and in the normal state. Below the transition temperature T c = 1.180 K 
the free energy is lower in the superconducting state. The two curves merge at the transition tem- 
perature, so that the phase transition is second order (there is no latent heat of transition at T c ). 
The curve F s is measured in zero magnetic field, and j\ is measured in a magnetic field sufficient 
to put the specimen in the normal state, (Courtesy of \. E. Phillips.) 



Figure 8 (a) The heat capacity of gallium in the normal and superconducting states. The normal 
state (which is restored by a 200 G field) lias electronic, lattice, and (at low temperatures) nuclear 
quadrupole contributions. In (b) the electronic part C, , of the heat capacity in the superconduct- 
ing state is plotted on a log scale versus T c /T: the exponential dependence on 1/T is evident. Here 
y = 0.60 mj mol -1 deg' 2 . (After N. E. Phillips.) 
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Figure 9 (a) Conduction band in the normal state: (b) energy gap at the Fermi level in the super- 
conducting state. Electrons in excited states above the gap behave as normal electrons in rf fields: 
they cause resistance; at dc they are shorted out by the superconducting electrons. The gap E e is 
exaggerated in the figure: typically E R ~ 10"' 1 e,.. 


Table 3 Energy gaps in superconductors, at T = 0 

Al 

Si 

£ s (0) in 10 4 eV. 

3.4 


£ g (0 )lk B T c . 

3.3 


Sc 

Ti 

V 

Cr 

Mn 

Fe 

Co 

Ni 

Cu 

Zn 

Ga 

Ge I 



16. 

] . 






2.4 

3.3 

_ • 



3.4 







3.2 

3.5 


Y 

Zr 

Nb 

Mo 

Tc 

Ru 

Rh 

Pd 

Ag 

Cd 

In 

Sn (w) 
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30;5 

2.7 






1.5 

10.5 

11.5 



3.80 

3.4 






3.2 

3.6 

3.5 

La fee 

Hf 

Ta 

W 

Re 

Os 

Ir 

Pt 

Au 

Hg M 

TI 

Pb j 

19. 


14.. 







16.5 

7.35 

27.3 

3.7 


3.60 







4.6 

3.57 

4.38 


second-order transition there is no latent heat, but there is a discontinuity in 
the heat capacity, evident in Fig. 8a. Further, the energy gap decreases contin- 
uously to zero as the temperature is increased to the transition temperature T,., 
as in Fig. 10. A first-order transition would be characterized by a latent heat 
and by a discontinuity in the energy gap. 

Microwave and Infrared Properties 

The existence of an energy gap means that photons of energy less than the 
gap energy are not absorbed. Nearly all the photons incident are reflected as 
for any metal because of the impedance mismatch at the boundary between 
vacuum and metal, but for a very thin (~20 A) film more photons are transmit- 
ted in the superconducting state than in the normal state. 
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r/r. 


Figure 10 Reduced values of the observed 
energy gap £ g {T)/E g ( 0) as a function of the 
reduced temperature 77T„ after Townsend and 
Sutton. The solid curve is drawn for the BCS 
theory. 


For photon energies less than the energy gap, the resistivity of a supercon- 
ductor vanishes at absolute zero. At T < ^T C the resistance in the superconduct- 
ing state has a sharp threshold at the gap energy. Photons of lower energy see a 
resistanceless surface. Photons of higher energy than the energy gap see a re- 
sistance that approaches that of the normal state because such photons cause 
transitions to unoccupied normal energy levels above the gap. 

As the temperature is increased not only does the gap decrease in energy, 
but the resistivity for photons with energy below the gap energy no longer van- 
ishes, except at zero frequency. At zero frequency the superconducting elec- 
trons short-circuit any normal electrons that have been thermally excited 
above the gap. At finite frequencies the inertia of the superconducting elec- 
trons prevents them from completely screening the electric field, so that ther- 
mally excited normal electrons now can absorb energy (Problem 3). 

Isotope Effect 

It has been observed that the critical temperature of superconductors 
varies with isotopic mass. In mercury T c varies from 4.185 K to 4.146 K as the 
average atomic mass M varies from 199.5 to 203.4 atomic mass units. The tran- 
sition temperature changes smoothly when we mix different isotopes of the 
same element. The experimental results within each series of isotopes may be 
fitted by a relation of the form 

M a T c = constant . 

Observed values of a are given in Table 4. 


( 2 ) 




Table 4 Isotope effect in superconductors 

Experimental values of a in M a T c = constant, where M is the isotopic mass. 


Substance 

a 

Substance 

CL 

Zn 

0.45 ± 0.05 

Ru 

0.00 ± 0.05 

Cd 

0.32 ± 0.07 

Os 

0.15 ± 0.05 

Sn 

0.47 ± 0.02 

Mo 

0.33 

Hg 

0.50 ± 0.03 

Nb 3 Sn 

0.08 ± 0.02 

Pb 

0.49 ± 0.02 

Zr 

0.00 ± 0.05 


From the dependence of T c on the isotopic mass we learn that lattice 
vibrations and hence electron-lattice interactions are deeply involved in super- 
conductivity. This was a fundamental discovery: there is no other reason for 
the superconducting transition temperature to depend on the number of neu- 
trons in the nucleus. 

The original BCS model gave the result T c & 0 nt , bv . M -1/2 , so that a = f 
in (2), but the inclusion of coulomb interactions between the electrons 
changes the relation. Nothing is sacred about a =1. The absence of an isotope 
effect in Ru and Zr has been accounted for in terms of the electron band struc- 
ture of these metals. 


THEORETICAL SURVEY 

A theoretical understanding of the phenomena associated with supercon- 
ductivity has been reached in several ways. Certain results follow directly from 
thermodynamics. Many important results can be described by phenomenolog- 
ical equations: the London equations and the Landau-Ginzburg equations 
(Appendix 1). A successful quantum theory of superconductivity was given by 
Bardeen, Cooper, and Schrieffer, and has provided the basis for subsequent 
work. Josephson and Anderson discovered the importance of the phase of the 
superconducting wavefunction. 

The r mody namics of the Superconducting Transition 

The transition between the normal and superconducting states is thermo- 
dynamically reversible, just as the transition between liquid and vapor phases 
of a substance is reversible. Thus we may apply thermodynamics to the transi- 
tion, and we thereby obtain an expression for the entropy difference between 
normal and superconducting states in terms of the critical field curve H r ver- 
sus T. This is analogous to the vapor pressure equation for the liquid-gas 
coexistence curve (TP, Chapter 10). 
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We treat a type I superconductor with a complete Meissner effect, so that 
B = 0 inside the superconductor. We shall see that the critical field H c is a quan- 
titative measure of the free energy difference between the superconducting and 
normal states at constant temperature. The symbol H c will always refer to a bulk 
specimen, never to a thin film. For type II superconductors, H c is understood to 
be the thermodynamic critical field related to the stabilization free energy. 

The stabilization free energy of the superconducting state with respect to 
the normal state can be determined by calorimetric or magnetic measure- 
ments. In the calorimetric method the heat capacity is measured as a function 
of temperature for the superconductor and for the normal conductor, which 
means the superconductor in a magnetic field larger than H c . From the differ- 
ence of the heat capacities we can compute the free energy difference, which 
is the stabilization free energy of the superconducting state. 

In the magnetic method the stabilization free energy is found from the 
value of the applied magnetic field that will destroy the superconducting 
state, at constant temperature. The argument follows. Consider the work done 
(Fig. 11) on a superconductor when it is brought reversibly at constant tem- 
perature from a position at infinity (where the applied field is zero) to a posi- 
tion r in the field of a permanent magnet: 

W = — f B M ■ dB a , (3) 

J o 




Figure 11 (a) A superconductor in which the Meissner effect is complete has B = 0, as if the 
magnetization were M = — in CGS units, (b) When the applied field reaches the value B„ c , 

the normal state can coexist in equilibrium with the superconducting state. In coexistence the free 
energy densities are equal: F N (T, B ac ) = F S (T, B ac ). 
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per unit volume of specimen. This work appears in the energy of the magnetic 
field. The thermodynamic identity for the process is 

dF =- M • dB a , (4) 

as in TP, Chapter 8. 

For a superconductor with M related to B„ by (1) we have 


(CGS) 

dF s = ±B a dB a ; 

(5) 

(SI) 

^F s = ^ B a dB a . 


The 

increase in the free energy density of the superconductor is 


(CGS) 

F s (B a ) - F s ( 0) = Bh 8tt ; 

(6) 

(SI) 

F s (B a ) - F,( 0) - B\! 2/q, , 



on being brought from a position where the applied field is zero to a position 
where the applied field is B a . 

Now consider a normal nonmagnetic metal. If we neglect the small 
susceptibility 4 of a metal in the normal state, then M = 0 and the energy of the 
normal metal is independent of field. At the critical field we have 

Fn(BJ = F„(0) . (7) 

The results (6) and (7) are all we need to determine the stabilization 
energy of the superconducting state at absolute zero. At the critical value B ac 
of the applied magnetic field the energies are equal in the normal and super- 
conducting states: 

(CCS) F n (BJ = F S (BJ = F s ( 0) + B % J 877 , (8) 

(SI) F N (BJ = F S (BJ = F s (0) + B%/2,i 0 , 

In SI units H c = B ac //x 0 , whereas in CGS units H c = B ac . 

The specimen is stable in either state when the applied field is equal to 
the critical field. Now by (7) it follows that 

(CCS) AF - F n ( 0) - F s ( 0) = Bi/877 , (9) 

4 This is an adequate assumption for type I superconductors. In type II superconductors in 
high fields the change in spin paramagnetism of the conduction electrons lowers the energy of the 
normal phase significantly. In some, but not all, type II superconductors the upper critical field is 
limited by this effect. Clogston has suggested that H ci ( max) = 18,400 T c , where H c 2 is in gauss and 
T c inK. 
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Applied magnetic field B a — >- 


Figure 12 The free energy density F v of a nonmag- 
netic normal metal is approximately independent of the 
intensity of the applied magnetic field B„. At a temper- 
ature T <T C die metal is a superconductor in zero mag- 
netic field, so that F S (T, 0) is lower than F N {T, 0). An 
applied magnetic field increases F, by ffy8ir, in CGS 
units, so that F S (T, B a ) = t\[T, 0) + F> % J%tt. If B u is larger 
than the critical field R„ r the free energy density is 
lower in the normal state than in the superconducting 
state, and now the normal state is the stable state. The 
origin of the vertical scale in the drawing is at F S (T, 0). 
The figure equally applies to U s and t/, v at T = 0. 


where AF is the stabilization free energy density of the superconducting state. 
For aluminum, B an at absolute zero is 105 gauss, so that at absolute zero 
AF = (105) 2 /8vr = 439 erg cm 3 , in excellent agreement with the result of 
thermal measurements, 430 erg cm -3 . 

At a finite temperature the normal and superconducting phases 
are in equilibrium when the magnetic field is such that their free energies 
F = U — TS are equal. The free energies of the two phases are sketched 
in Fig. 12 as a function of the magnetic field. Experimental curves of the 
free energies of the two phases for aluminum are shown in Fig. 7. Because 
the slopes dF/dT are equal at the transition temperature, there is no latent 
heat at T c . 

London Equation 

We saw that the Meissner effect implies a magnetic susceptibility X — — VAir 
in CGS in the superconducting state or, in SI, X = — 1. Can we modify a consti- 
tutive equation of electrodynamics (such as Ohm’s law) in some way to obtain 
the Meissner effect? We do not want to modify the Maxwell equations them- 
selves. Electrical conduction in the normal state of a metal is described by 
Ohms law j = <tE. We need to modify this drastically to describe conduction 
and the Meissner effect in the superconducting state. Let us make a postulate 
and see what happens. 

We postulate that in the superconducting state the current density is di- 
rectly proportional to the vector potential A of the local magnetic field, where 
B — curl A. The gauge of A will be specified. In CGS units we write the 
constant of proportionality as — cM-jrAf, for reasons that will become clear. 
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Here c is the speed of light and \ L is a constant with the dimensions of length. 
In SI units we write — l/p, (t Aj . Thus 

(CGS) (SI) j = ~^ A - < 10 ) 

4 7™,. M(Al V :|. 

This is the London equation. We express it another way by taking the curl of 
both sides to obtain 

(CGS) curl j = ; (SI) curl j = ^B. (11) 

4ttA| 

The London equation (10) is understood to be written with the vector po- 
tential in the London gauge in which div A = 0, and A„ = 0 on any external 

surface through which no external current is fed. The subscript n denotes the 

component normal to the surface. Thus div j = 0 and j„ = 0, the actual physi- 
cal boundary conditions. The form (10) applies to a simply connected super- 
conductor; additional terms may be present in a ring or cylinder, but (11) 
holds true independent of geometry. 

First we show that the London equation leads to the Meissner effect. By a 
Maxwell equation we know that 

(CGS) curlB=-^j ; (SI) curl B = p. 0 j ; (12) 

under static conditions. We take the curl of both sides to obtain 
(CGS) curl curl B = — V 2 B = ~ curl j ; 

(SI) curl curl B = - V 2 B = /x 0 curl j ; 


which may be combined with the London equation (11) to give for a super- 
conductor 

V 2 B = B/Af . (13) 

This equation is seen to account for the Meissner effect because it does 
not allow a solution uniform in space, so that a uniform magnetic field cannot 
exist in a superconductor. That is, B(r) = B 0 = constant is not a solution of 
(13) unless the constant field B 0 is identically zero. The result follows because 
V 2 B 0 is always zero, but B 0 /A| is not zero unless B 0 is zero. Note further that 
(12) ensures that j = 0 in a region where B = 0. 

In the pure superconducting state the only field allowed is exponentially 
damped as we go in from an external surface. Let a semi-infinite superconductor 
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Figure 13 Penetration of an applied magnetic 
® field into a semi-infinite superconductor. The 

f 1 t penetration depth A is defined as the distance in 

which the field decreases by the factor e . Typi- 
cally, A = 500 A in a pure superconductor. 


occupy the space on the positive side of the x axis, as in Fig. 13. If B(0) is the 
field at the plane boundary, then the field inside is 

B(x) = B(0)exp(-x/A L ) , (14) 

for this is a solution of (13). In this example the magnetic field is assumed to 
be parallel to the boundary. Thus we see A L measures the depth of penetration 
of the magnetic field; it is known as the London penetration depth. Actual 
penetration depths are not described precisely by A L alone, for the London 
equation is now known to be somewhat oversimplified. It is shown by compari- 
son of (22) with (11) that 

(CGS) A l = (mc 2 /4Tmq 2 ) V2 ; (SI) A L = (e t) mc 2 /nq 2 ) V2 (14a) 

for particles of charge q and mass m in concentration n. Values are given in 
Table 5. 

An applied magnetic field B„ will penetrate a thin film fairly uniformly if 
the thickness is much less than A L ; thus in a thin film the Meissner effect is not 
complete. In a thin film the induced field is much less than B a , and there is 
little effect of B a on the energy density of the superconducting state, so that 
(6) does not apply. It follows that the critical field H c of thin films in parallel 
magnetic fields will be very high. 


Table 5 Calculated intrinsic coherence length and 
London penetration depth, at absolute zero 


Metal 

Intrinsic Pippard 
coherence 
length £ 0 , 
in 10 6 cm 

London 
penetration 
depth A l , 
in 10 -6 cm 


Sn 

23. 

3.4 

0.16 

Al 

160. 

1.6 

0.010 

Pb 

8.3 

3.7 

0.45 

Cd 

76. 

11.0 

0.14 

Nb 

3.8 

3.9 

1.02 


After R. Meservey and B. B. Schwartz. 
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Coherence Length 

The London penetration depth A L is a fundamental length that character- 
izes a superconductor. An independent length is the coherence length £. The 
coherence length is a measure of the distance within which the superconduct- 
ing electron concentration cannot change drastically in a spatially-varying 
magnetic field. 

The London equation is a local equation: it relates the current density at a 
point r to the vector potential at the same point. So long as j(r) is given as a 
constant time A(r), the current is required to follow exactly any variation in 
the vector potential. But the coherence length £ is a measure of the range over 
which we should average A to obtain j. It is also a measure of the minimum spa- 
tial extent of a transition layer between normal and superconductor. The coher- 
ence length is best introduced into the theory through the Landau-Ginzburg 
equations, Appendix I. Now we give a plausibility argument for the energy re- 
quired to modulate the superconducting electron concentration. 

Any spatial variation in the state of an electronic system requires extra 
kinetic energy. A modulation of an eigenfunction increases the kinetic energy 
because the modulation will increase the integral of d 2 tp/dx 2 . It is reasonable to 
restrict the spatial variation of j(r) in such a way that the extra energy is less 
than the stabilization energy of the superconducting state. 

We compare the plane wave i jj(x) = e' ix with the strongly modulated 
wavefu notion: 

<p{x) = T m + e fc ) . (15a) 

The probability density associated with the plane w'ave is uniform in space: 

= e !kx e ikx = 1 , whereas cp*<p is modulated with the wavevector q: 

if* ip = l(e~ i(k+t/)l + c~ ihx )(e iik+ ‘^ + e ifa ) 

= 1(2 + + e -W) = 1 + cos qx . (15b) 

The kinetic energy of the wave i fi(x) is e = h 2 k 2 /2m-, the kinetic energy of 
the modulated density distribution is higher, for 



[(fc + < 7 )2 +fc 2] ss £U 2 +£fy , 


where we neglect q 2 for q k. 

The increase ot energy required to modulate is h 2 kq/2m. If this increase 
exceeds the energy gap E„, superconductivity will be destroyed. The critical 
value q 0 of the modulation wavevector is given by 

h 2 7 _ „ 

2 m Mo-£ g • 


(16a) 
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We define an intrinsic coherence length £ 0 related to the critical modu- 
lation by £ 0 = 1 /q 0 . We have 

fo = h%!2mE s = fiu t /2E g , ( 16b) 

where v F is the electron velocity at the Fermi surface. On the BCS theory a 
similar result is found: 


£o - 2hv F /rrE $ . 


(17) 


Calculated values of £o from (17) are given in Table 5. The intrinsic coherence 
length £ 0 is characteristic of a pure superconductor. 

In impure materials and in alloys the coherence length | is shorter than £ 0 . 
This may be understood qualitatively: in impure material the electron eigen- 
functions already have wiggles in them: wc can construct a given localized 
variation of current density' with less energy from wavefunctions with wiggles 
than from smooth wavefunctions. 

The coherence length first appeared in the Landau-Ginzburg equations; 
these equations also follow from the BCS theory'. They describe the structure 
of the transition lay'er between normal and superconducting phases in contact. 
The coherence length and the actual penetration depth A depend on the mean 
free path € of the electrons measured in the normal state; the relationships are 
indicated in Fig. 14. When the superconductor is very' impure, with a very 
small €, then £ ~ (£,-/) 1/2 and A « A L (£o/f) 1/2 , so that A/£ = A ,/t . This is the 
“dirty superconductor” limit. The ratio A/£ is denoted by k. 


BCS Theory of Superconductivity 

The basis of a quantum theory' of superconductivity was laid by the classic 
1957 papers of Bardeen, Cooper, and Schrieffer. There is a “BCS theory of 
superconductivity” with a very wide range of applicability', from He 3 atoms in 
their condensed phase, to type I and type II metallic superconductors, and to 
high-temperature superconductors based on planes of cuprate ions. Further, 



So 


Figure 14 Penetration depth A and the coherence 
length £ as functions of the mean free path £ of the 
conduction electrons in the normal state. All 
lengths are in units of £>, the intrinsic coherence 
length. The curves are sketched for £ 0 = 10A L . For 
short mean free paths the coherence length be- 
comes shorter and the penetration depth becomes 
longer. The increase in the ratio kA/£ favors type II 
superconductivity. 
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tli ere is a “BCS wavefunction” composed of particle pairs kt and "kf, which, 
when treated by the BCS theory, gives the familiar electronic superconductiv- 
ity observed in metals and exhibits the energy gaps of Table 3. This pairing is 
known as s-wave pairing. There are other forms of particle pairing possible 
with the BCS theory, but we do not have to consider other than the BCS wave- 
function here. In this chapter we treat the specific accomplishments of BCS 
theory with a BCS wavefunction, which include: 

1. An attractive interaction between electrons can lead to a ground state 
separated from excited states by an energy gap. The critical field, the thermal 
properties, and most of the electromagnetic properties are consequences of 
the energy gap. 

2. The elcctron-lattice-electron interaction leads to an energy gap of the 
observed magnitude. The indirect interaction proceeds when one electron in- 
teracts with the lattice and deforms it; a second electron sees the deformed 
lattice and adjusts itself to take advantage of the deformation to lower its en- 
ergy. Thus the second electron interacts with the first electron via the lattice 
deformation. 

3 . The penetration depth and the coherence length emerge as natural 
consequences of the BCS theory. The London equation is obtained for mag- 
netic fields that vary slowly in space. Thus the central phenomenon in super- 
conductivity, the Meissner effect, is obtained in a natural way. 

4 . The criterion for the transition temperature of an element or alloy in- 
volves the electron density' of orbitals D(e F ) of one spin at the Fermi level and 
the electron-lattice interaction 17, which can be estimated from the electrical 
resistivity because the resistivity at room temperature is a measure of the 
electron-phonon interaction. For UD(e F ) 1 the BCS theory predicts 

T c = 1.146 exp[— I/C7D(e F )] , (18) 

where 8 is the Debye temperature and U is an attractive interaction. The re- 
sult for T c is satisfied at least qualitatively by the experimental data. There is 
an interesting apparent paradox: the higher the resistivity at room temperature 
the higher is U, and thus the more likely it is that the metal will be a super- 
conductor when cooled. 

5. Magnetic flux through a superconducting ring is quantized and the ef- 
fective unit of charge is 2e rather than e. The BCS ground state involves pairs 
of electrons; thus flux quantization in terms of the pair charge 2c is a conse- 
quence of the theory'. 

BCS Ground State 

The filled Fermi sea is the ground state of a Fermi gas of noninteract- 
ing electrons. This state allows arbitrarily small excitations — we can form an 
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(a) 


(b) 


Figure 15 (a) Probability P that an or- 
bital of kinetic energy e is occupied in the 
ground state of the noninteracting Fermi 
gas; (b) the BCS ground state differs 
from the Fermi state in a region of width 
of the order of the energy gap E g . Both 
curves are for absolute zero. 


excited state by taking an electron from the Fermi surface and raising it just 
above the Fermi surface. The BCS theory shows that with an appropriate at- 
tractive interaction between electrons the new ground state is superconduct- 
ing and is separated by a finite energy E g from its lowest excited state. 

The formation of the BCS ground state is suggested by Fig. 15. The BCS 
state in (b) contains admixtures of one-electron orbitals from above the Fermi 
energy e F . At first sight the BCS state appears to have a higher energy than the 
Fermi state: the comparison of (b) with (a) shows that the kinetic energy of the 
BCS state is higher than that of the Fermi state. But the attractive potential 
energy of the BCS state, although not represented in the figure, acts to lower 
the total energy of the BCS state with respect to the Fermi state. 

When the BCS ground state of a many-electron system is described in 
terms of the occupancy of one-particle orbitals, those near e F are filled some- 
what like a Fermi-Dirac distribution for some finite temperature. 

The central feature of the BCS state is that the one-particle orbitals are 
occupied in pairs: if an orbital with wavevector k and spin up is occupied, then 
the orbital with wavevector — k and spin down is also occupied. If kT is vacant, 
then — ksl is also vacant. The pairs are called Cooper pairs, treated in 
Appendix H. They have spin zero and have many attributes of bosons. 

Flux Quantization in a Superconducting Ring 

We prove that the total magnetic flux that passes through a superconduct- 
ing ring may assume only quantized values, integral multiples of the flux quan- 
tum 27 rhc/q, where by experiment q = 2e, the charge of an electron pair. Flux 
quantization is a beautiful example of a long-range quantum effect in which 
the coherence of the superconducting state extends over a ring or solenoid. 

Let us first consider the electromagnetic field as an example of a similar 
boson field. The electric field intensity E(r) acts qualitatively as a probability 
field amplitude. When the total number of photons is large, the energy density 
may be written as 


E* (r)E(r)/47T = n(r)hw , 



where n(r) is the numher density of photons of frequency o>. Then we may 
write the electric field in a semiclassical approximation as 


E(r) s (4fflM 1/2 n( r) 1/2 e w(r) £*( r) = (4rr*w) 1/2 n( r) m e^ 0[l] , 


where 0(r) is the phase of the field. A similar probability amplitude describes 
Cooper pairs. 

The arguments that follow apply to a boson gas with a large number of 
bosons in the same orbital. We then can treat the boson probability amplitude 
as a classical quantity, just as the electromagnetic field is used for photons. Both 
amplitude and phase are then meaningful and observable. The arguments do 
not apply to a metal in the normal state because an electron in the normal state 
acts as a single unpaired fermion that cannot be treated classically. 

We first show that a charged boson gas obeys the London equation. 
Let ip(r) be the particle probability amplitude. We suppose that the pair 
concentration n — = constant. At absolute zero n is one-half of the con- 

centration of electrons in the conduction band, for n refers to pairs. Then we 
may write 

i \t = n m e im ; (]/* = n m e ieir> . ( 19 ) 


The phase 0(r) is important for what follows. In SI units, set c = 1 in the equa- 
tions that follow. 

The velocity of a particle is, from the Hamilton equations of mechanics, 


(CGS) 



The particle flux is given by 


ip’vtj/ — 




so that the electric current density is 


j = 



( 20 ) 


( 21 ) 


We may take the curl of both sides to obtain the London equation: 

■v2 


curl j = 


nq 


me 


B 


( 22 ) 


with use of the fact that the curl of the gradient of a scalar is identically zero. 
The constant that multiplies B agrees with (14a). We recall that the Meissner 
effect is a consequence of the London equation, which w'e have here 
derived. 

Quantization of the magnetic flux through a ring is a dramatic conse- 
quence of Eq. (21). Let us take a closed path C through the interior of the 
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Flux lines 



Figure 16 Path of integration C through the interior of a 
superconducting ring. The flux through the ring is the sum 
of the flux $ 5 ,, from external sources and the flux 4> sc from 
the superconducting currents which flow in the surface of 
the ring; $ — <l> est + <h se . The flux <t> is quantized. There is 
normally no quantization condition on the flux from exter- 
nal sources, so that <t> sc must adjust itself appropriately in 
order that assume a quantized value. 


superconducting material, well away from the surface (Fig. 16). The Meissner 
effect tells us that B and j are zero in the interior. Now (21) is zero if 


hcVO = qA . 

(23) 

We form 


<j> ve-dl = e 2 -e 1 


for the change of phase on going once around the ring. 

The probability amplitude ip is measurable in the classical approximation, 
so that ip must be single-valued and 

0 2 ~ 6\ — 27 TS , 

(24) 

where s is an integer. By the Stokes theorem, 


<j> A • dl = J (curl A) • da = J B 

• da = , (25) 


where dor is an element of area on a surface bounded by the curve C, and <1> is 
the magnetic flux through C. From (23), (24), and (25) we have 2 irhcs = q$>, or 

<I> = ( 2irhc/q)s . (26) 


Thus the flux through the ring is quantized in integral multiples of 2-7T ftc/q. 

By experiment q = — 2e as appropriate for electron pairs, so that the quan- 
tum of flux in a superconductor is 

(CGS) <b 0 = 2Trhc/2e = 2.0678 X 10“ ' gauss cm 2 ; 

(SI) <&o = 2irh/2e = 2.0678 X 10“ 15 tesla m 2 . ( 27 ) 

This flux quantum is called a fluxoid or fluxon. 

The flux through the ring is the sum of the flux 3> ext from external sources 
and the flux <l> sc from the persistent superconducting currents which flow in 
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the surface of the ring: $ = + <3> sc . The flux $ is quantized. There is nor- 

mally no quantization condition on the flux from external sources, so that 
must adjust itself appropriately in order that <f> assume a quantized value. 

Duration of Persistent Currents 

Consider a persistent current that flows in a ring of a type I superconduc- 
tor of wire of length L and cross-sectional area A. The persistent current main- 
tains a flux through the ring of some integral number of lluxoids (27). A fluxoid 
cannot leak out of the ring and thereby reduce the persistent current unless by 
a thermal fluctuation a minimum volume of the superconducting ring is mo- 
mentarily in the normal state. 

The probability per unit time that a fluxoid will leak out is the product 

P = (attempt frequency)(activation barrier factor) . (28) 

The activation barrier factor is exp(— A F/k B T), where the free energy of the 
barrier is 

AF ~ (minimum volume)(excess free energy density of normal state) . 

The minimum volume of the ring that must turn normal to allow a fluxoid to 
escape is of the order of Ky 2 , where 6 is the coherence length of the super- 
conductor and R the wire thickness. The excess free energy' density of the nor- 
mal state is whence the barrier free energy is 

AF - HP 2 if/8TT . (29) 

Let the wire thickness be 10 4 cm, the coherence length — 10 -4 cm, and 
H c — 10 3 G; then AF =“ 10“ ' erg. As we approach the transition temperature 
from below, A F will decrease toward zero, but the value given is a fair estimate 
between absolute zero and 0.8 T,.. Thus the activation barrier factor is 

exp(— AF/k B T) ~ exp(-10 B ) « l(T< 434xlo7) . 

The characteristic frequency with which the minimum volume can attempt 
to change its state must be of the order of E^/h. If E g = 10 -15 erg, the attempt 
frequency is —10 15 /10 27 — 10 12 s _1 . The leakage probability (28) becomes 

P = 10 12 10-“ 34X107 s- 1 = io- 434xl ° , s ^ 1 . 

The reciprocal of this is a measure of the time required for a fluxoid to leak 
out, T = 1/P = 10 434xl ° 7 s. 

The age of the universe is only 10 1,s s, so that a fluxoid will not leak out in 
the age of the universe, under our assumed conditions. Accordingly, the cur- 
rent is maintained. 

There are two circumstances in which the activation energy is much lower 
and a fluxoid can be observed to leak out of a ring — either very close to the 
critical temperature, where H c is very small, or when the material of the ring is 
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a type II superconductor and already has fluxoids embedded in it. These spe- 
cial situations are discussed in the literature under the subject of fluctuations 
in superconductors. 

Type II Superconductors 

There is no difference in the mechanism of superconductivity in type I 
and type II superconductors. Both types have similar thermal properties at the 
superconductor-normal transition in zero magnetic field. But the Meissner 
effect is entirely different (Fig. 5). 

A good type I superconductor excludes a magnetic field until super- 
conductivity is destroyed suddenly, and then the field penetrates completely. A 
good type II superconductor excludes the field completely up to a field H cl . 
Above H cl the field is partially excluded, but the specimen remains electrically 
superconducting. At a much higher field, H c2 , the flux penetrates completely 
and superconductivity vanishes. (An outer surface layer of the specimen may 
remain superconducting up to a still higher field f/ cJ .) 

An important difference in a type I and a type II superconductor is in the 
mean free path of the conduction electrons in the normal state. If the coher- 
ence length £ is longer than the penetration depth A, the superconductor will 
be type I. Most pure metals are type I, with A/f < 1 (see Table 5 on p. 275). 

But, when the mean free path is short, the coherence length is short and 
the penetration depth is great (Fig. 14). This is the situation when A/£ > 1, 
and the superconductor will be type II. 

We can change some metals from type I to type II by a modest addition 
of an alloying element. In Figure 5 the addition of 2 w 7 t. percent of indium 
changes lead from type I to type II, although the transition temperature is 
scarcely changed at all. Nothing fundamental has been done to the electronic 
structure of lead by this amount of alloying, but the magnetic behavior as a 
superconductor has changed drastically. 

The theory of type II superconductors was developed by Ginzburg, 
Landau, Abrikosov, and Gorkov. Later Kunzler and co-workers observed that 
NbjSn wares can carry large supercurrcnts in fields approaching 100 kC; this 
led to the commercial development of strong- field superconducting magnets. 

Consider the interface between a region in the superconducting state and 
a region in the normal state. The interface has a surface energy that may be 
positive or negative and that decreases as the applied magnetic field is in- 
creased. A superconductor is type I if the surface energy is alw'ays positive as 
the magnetic field is increased, and type II if the surface energy' becomes 
negative as the magnetic field is increased. The sign of the surface energy has 
no importance for the transition temperature. 

The free energy of a bulk superconductor is increased when the magnetic 
field is expelled. However, a parallel field can penetrate a very thin film nearly 
uniformly (Fig. 17), only a part of the flux is expelled, and the energy of the 
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(a) (b) 

Figure 17 (a) Magnetic field penetration into a thin film of thickness equal to the penetration 
depth A. The arrows indicate the intensity of the magnetic field, (b) Magnetic field penetration in 
a homogeneous bulk structure in the mixed or vortex state, with alternate layers in normal and su- 
perconducting states. The superconducting layers are thin in comparison with A. The laminar 
structure is shown for convenience; the actual structure consists of rods of the normal state sur- 
rounded by the superconducting state. (The N regions in the vortex state are not exactly normal, 
but are described by low values of the stabilization energy density.) 


superconducting film will increase only slowly as the external magnetic field is 
increased. This causes a large increase in the field intensity required for the 
destruction of superconductivity. The film has the usual energy gap and will be 
resistanceless. A thin film is not a type II superconductor, but the film results 
show that under suitable conditions superconductivity can exist in high mag- 
netic fields. 

Vortex State. The results for thin films suggest the question: Are there sta- 
ble configurations of a superconductor in a magnetic field with regions (in the 
form of thin rods or plates) in the normal state, each normal region sur- 
rounded by a superconducting region? In such a mixed state, called the vortex 
state, the external magnetic field will penetrate the thin normal regions uni- 
formly, and the field will also penetrate somewhat into the surrounding super- 
conducting material, as in Fig. 18. 

The term vortex state describes the circulation of superconducting 
currents in vortices throughout the bulk specimen, as in Fig. 19. There is no 
chemical or crystallographic difference between the normal and the supercon- 
ducting regions in the vortex state. The vortex state is stable when the penetra- 
tion of the applied field into the superconducting material causes the surface 
energy to become negative. A type II superconductor is characterized by a 
vortex state stable over a certain range of magnetic field strength; namely, 
between H cl and H c2 . 

Estimation of H cl and H c2 . What is the condition for the onset of the 
vortex state as the applied magnetic field is increased? We estimate H cl from 
the penetration depth A . The field in the normal core of the fluxoid will be H cl 
when the applied field is H cl . 
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Figure 18 Variation of the magnetic field and en- 
ergy gap parameter A(x) at the interface of super- 
conducting and normal regions, for type I and 
type II superconductors. The energy gap parameter 
is a measure of the stabilization energy density of 
the superconducting state. 


The field will extend out from the normal core a distance A into the super- 
conducting environment. The flux thus associated with a single core is 7rA 2 H cl , 
and this must be equal to the flux quantum 3> 0 defined by (27). Thus 

H c i — 3>o/7tA 2 . (30) 

This is the field for nucleation of a single fluxoid. 

At H c2 the fluxoids are packed together as tightly as possible, consistent 
with the preservation of the superconducting state. This means as densely as 
the coherence length £ will allow. The external field penetrates the specimen 
almost uniformly, with small ripples on the scale of the fluxoid lattice. Each 
core is responsible for carrying a flux of the order of 7 r£ 2 H c2 , which also is 
quantized to <t> 0 . Thus 

*«/«£* (31) 

gives the upper critical field. The larger the ratio A/£, the larger is the ratio of 
H c 2 to H cl . 



Figure 19 Flux lattice in NbSe, at 1,000 gauss at 0.2K, as viewed with a scanning tunneling 
microscope. The photo shows the density of states at the Fermi level, as in Figure 23. The vortex 
cores have a high density of states and are shaded white; the superconducting regions are dark, 
with no states at the Fermi level. The amplitude and spatial extent of these states is determined by 
a potential well formed by A(x) as in Fig. 18 for a Type II superconductor. The potential well 
confines the core state wavefunctions in the image here. The star shape is a finer feature, a result 
special to NbSe 2 of the sixfold disturbance of the charge density at the Fermi surface. Photo cour- 
tesy of H. F. Hess. 


It remains to find a relation between these critical fields and the thermo- 
dynamic critical field H c that measures the stabilization energy density of the 
superconducting state, which is known by (9) to be H?./8v. In a type II super- 
conductor we can determine H c only indirectly by calorimetric measurement 
of the stabilization energy. To estimate H cl in terms of H c , we consider the 
stability of the vortex state at absolute zero in the impure limit £ < A; here 
k > 1 and the coherence length is short in comparison with the penetration 
depth. 

We estimate in the vortex state the stabilization energy of a fluxoid core 
viewed as a normal metal cylinder which carries an average magnetic field B a . 
The radius is of the order of the coherence length, the thickness of the bound- 
ary between N and S phases. The energy of the normal core referred to the 
energy of a pure superconductor is given by the product of the stabilization 
energy times the area of the core: 

/core 5=5 J^ H c X 77^ 2 , 


(CGS) 


(32) 
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per unit length. But there is also a decrease in magnetic energy because of the 
penetration of the applied field B a into the superconducting material around 


the core: 

(CGS) /mag *“ ~ Bl X Tr\ 2 . (33) 

For a single fluxoid we add these two contributions to obtain 
(CGS) /=/core + /mag * I (#c¥ ~ . (34) 

The core is stable if/ < 0. The threshold field for a stable fluxoid is at/ = 0, 
or, with H cl written for B a , 

H c yfH c ~&. A . (35) 

The threshold field divides the region of positive surface energy from the re- 
gion of negative surface energy. 

We can combine (30) and (35) to obtain a relation for H c : 

77£AH c = <D 0 . (36) 

We can combine (30), (31), and (35) to obtain 

(H c1 H c2 ) v2 ~H c , (37a) 

and 

H c2 ~ (A/f)H, = xH c . (37b) 

Single Particle Tunneling 

Consider two metals separated by an insulator, as in Fig. 20. The insulator 


normally acts as a barrier to the flow of conduction electrons from one metal 
to the other. If the barrier is sufficiently thin (less than 10 or 20 A) there is a 
significant probability that an electron which impinges on the barrier will pass 
from one metal to the other: this is called tunneling. In many experiments the 
insulating layer is simply a thin oxide layer formed on one of two evaporated 
metal films, as in Fig. 21. 

When both metals are normal conductors, the current-voltage relation of 
the sandwich or tunneling junction is ohmic at low voltages, with the current 
directly proportional to the applied voltage. Giaever (1960) discovered that if 
one of the metals becomes superconducting the current-voltage characteristic 
changes from the straight line of Fig. 22a to the curve shown in Fig. 22b. 


Figure 20 Two metals, A and B, separated by a thin layer of an 
insulator C. 
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Figure 21 Preparation of an Al/Al 2 0 3 /$n sandwich, (a) Glass slide with indium contacts, (b) An 
aluminum strip 1 mm wide and 1000 to 3000 A thick has been deposited across the contacts, 
(c) The aluminum strip has been oxidized to form an Al 2 0 3 layer 10 to 20 A in thickness, (d) A tin 
film has been deposited across the aluminum film, forming an Al/AhCb/Sn sandwich. The external 
leads are connected to the indium contacts; two contacts are used for the current measurement 
and two for the voltage measurement. (After Giaever and Megerle.) 


Figure 22 (a) Linear current-voltage 

relation for junction of normal metals 
separated by oxide layer; (b) current- 
voltage relation with one metal normal 
and the other metal superconducting. 





Figure 23 The density of orbitals and the current-voltage characteristic for a tunneling junction. 
In (a) the energy is plotted on the vertical scale and the density of orbitals on the horizontal scale. 
One metal is in the normal state and one in the superconducting state, (b) I versus V; the dashes 
indicate the expected break at T = 0. (After Giaever and Megerle.) 


Figure 23a contrasts the electron density of orbitals in the superconductor 
with that in the normal metal. In the superconductor there is an energy gap 
centered at the Fermi level. At absolute zero no current can flow until the 
applied voltage is V = E g /2e = A/e. 

The gap E„ corresponds to the break-up of a pair of electrons in the 
superconducting state, with the formation of two electrons, or an electron and 
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a hole, in the normal state. The current starts when eV — A. At finite 
temperatures there is a small current flow even at low voltages, because 
of electrons in the superconductor that are thermally excited across the 
energy gap. 

Josephson Superconductor Tunneling 

Under suitable conditions we observe remarkable effects associated with 
the tunneling of superconducting electron pairs from a superconductor 
through a layer of an insulator into another superconductor. Such a junction is 
called a weak link. The effects of pair tunneling include: 

Dc Josephson effect. A dc current flows across the junction in the ab- 
sence of any electric or magnetic field. 

Ac Josephson effect. A dc voltage applied across the junction causes 
rf current oscillations across the junction. This effect has been utilized in a 
precision determination of the value of h/e. Further, an rf voltage applied with 
the dc voltage can then cause a dc current across the junction. 

Macroscopic long-range quantum interference. A dc magnetic field 
applied through a superconducting circuit containing two junctions causes the 
maximum supercurrcnt to show interference effects as a function of magnetic 
field intensity. This effect can be utilized in sensitive magnetometers. 

Dc Josephson Effect. Our discussion of Josephson junction phenomena 
follows the discussion of flux quantization. Let t /q be the probability amplitude 
of electron pairs on one side of a junction, and let ij/ 2 be the amplitude on the 
other side. For simplicity, let both superconductors be identical. For the pres- 
ent we suppose that they are both at zero potential. 

The time-dependent Schrodinger equation iHdi/f/dt = c Mijf applied to the 
two amplitudes gives 


ih = fiTtjiz ; ih = hT t/q . (38) 

Here hT represents the effect of the electron-pair coupling or transfer interac- 
tion across the insulator; T has the dimensions of a rate or frequency. It is a 
measure of the leakage of i/q into the region 2, and of t/q into the region 1. If 
the insulator is very thick, T is zero and there is no pair tunneling. 

Let t/q = n|' 2 e ‘ #1 and t/q = n 2 2 e' ° 2 . Then 


cb/q 

8t 




dri, 86, 

-dr +i ^-df = 




dt 


I n 2 m e i ' 


drio 86 q 


(39) 


(40) 
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We multiply (39) by n\' 2 e - to obtain, with 8 = 6 2 - 0 U 


1 an, . ddi 


i 1 + Wi lir = —iT(n ] n a ) m e? s . 


We multiply (40) by n^ 2 e to obtain 


1 d^2 , . ^^2 .-T-, 0/2 -if) 

2~df +m2 lH = ~ lT(n ^ ) 6 


(41) 


(42) 


Now equate the real and imaginary' parts of (41) and similarly of (42): 


a«j 

dt 


^ = 2T(n 1 n 2 ) I/2 sinfi ; ~ - -27’(n 1 n 2 ) jy2 sin 8 ; 


nr 


^1 T ( n 2 i ~ — a 

-^-= “Mir) cos ® ; — 


dt 

d0 9 


dt 


-T ( — 

' tto 


1/2 


cos 8 . 


(43) 

(44) 


If n 1 = n 2 as for identical superconductors 1 and 2. we have from (44) that 


dOy _ aa 2 
~dt ~ ~dF 




(45) 


From (43) we sec that 


dn 2 _ dny 
~ ~~dt ‘ 


(46) 


The current flow from (1) to (2) is proportional to dn 2 /dt or, the same 
thing, — driy/dt . We therefore conclude from (43) that fhe current J of super- 
conductor pairs across the junction depends on the phase difference 8 as 


J =] 0 sin 8 =J Q sin (0 2 - 0,) . 


(47) 


where /„ is proportional to the transfer interaction T. The current J 0 is the 
maximum zero-voltage current that can be passed by the junction. With no 
applied voltage a dc current will flow across the junction (Fig. 24), with a value 
between J 0 and —J 0 according to the value of the phase difference 8 2 ~ &\- 
This is the dc Josephson effect. 

Ac Josephson Effect. Let a dc voltage V be applied across the junction. We 
can do this because the junction is an insulator. An electron pair experiences a 
potential energy difference qV on passing across the junction, where q = ~2e. 
We can say that a pair on one side is at potential energy — eV and a pair on the 
other side is at eV. The equations of motion that replace (38) are 

ih dijjy/dt = hT(l>. 2 — eVijiy ; ih fUji 2 /dl. = A7'i/r, + eVi/o . (48) 


We proceed as above to find in place of (41) the equation 

1 dfrq . d^i , >1/2 jg 

2 ~df + m i nr = ie ^ n ^ ir(«i«s) « ■ 


(49) 
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Voltage 


Figure 24 Current-voltage characteristic of a Josephson 
junction. Dc currents flow under zero applied voltage up 
to a critical current i c : this is the dc Josephson effect. At 
voltages above V. the junction has a finite resistance, but 
the current has an oscillatory component of frequency 
to = 2eV/h: this is the ac Josephson effect. 


This equation breaks up into the real part 

drii/dt = 2T(n 1 n 2 ) U2 sin 8 , (50) 

exactly as without the voltage V, and the imaginary part 

tid i ! tit — (eV/h) — r(n 2 / ri i) 1/2 cos 8 , (51) 

which differs from (44) by the term eV/h. 

Further, by extension of (42), 

\ ^ + in 2 ^ = ~i eVn 2 hT l - iT(n 1 n 2 ) m e~ iS , (52) 

whence 

dn 2 /dt = — 2T(r , t 1 n 2 ) ,/2 sin 8 ; (53) 

dd 2 /dt = —(eV/fi) — T(n 1 /n z ) y2 cos 8 . (54) 

From (51) and (54) with rq ss n 2 , we have 

d(d 2 ~ dj/dt = 88/dt = -2eV/h . (55) 

We see by integration of (55) that with a dc voltage across the junction the 
relative phase of the probability amplitudes varies as 

8(t) = 8(0) - (2eVt/h) . (56) 

The superconducting current is given by (47) with (56) for the phase: 

J — Jo sin [5(0) — (2eVt/h)} . 


(57) 
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The current oscillates with frequency 

co = 2 eV/h . (58) 

This is the ac Josephson effect. A dc voltage of 1 /xV produces a frequency 
of 483.6 MHz. The relation (58) says that a photon of energy hw = 2eV is 
emitted or absorbed when an electron pair crosses the barrier. By measuring 
the voltage and the frequency it is possible to obtain a very precise value 
of e/h. 

Macroscopic Quantum Interference. We saw in (24) and (26) that the 
phase difference 0 2 — around a closed circuit which encompasses a total 
magnetic flux f t> is given by 

0 2 - 0! = (2e/hc)$> . (59) 

The flux is the sum of that due to external fields and that due to currents in the 
circuit itself. 

We consider two Josephson junctions in parallel, as in Fig. 25. No voltage 
is applied. Let the phase difference between points 1 and 2 taken on a path 
through junction a be 8 a . When taken on a path through junction b, the phase 
difference is 8 h . In the absence of a magnetic field these two phases must be 
equal. 

Now let the flux pass through the interior of the circuit. We do this 
with a straight solenoid normal to the plane of the paper and lying inside the 
circuit. By (59), 8 b — 8 a = (2e/hc)<f>, or 

8b = 8 o+ 't <p ; 8 a= 8 "-~b (t> ■ (60) 

The total current is the sum of J a and J h . The current through each junc- 
tion is of the form (47), so that 

/Total =Jo jsm + Y c $ ) + sin ^ S 0 - ~ j ■ = 2(/ 0 sin 8 0 ) cos ^ . 



Figure 25 The arrangement for experiment on 
macroscopic quantum interference. A magnetic 
flux <J> passes through the interior of the loop. 


Insulator h 
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A 




Magnetic field (milligauss) 

Figure 26 Experimental trace of / mil . versus magnetic field showing interference and diffraction 
effects for two junctions A and B. The field periodicity is 39.5 and 16 mG for A and B, respec- 
tively. Approximate maximum currents are 1 mA (A) and 0.5 mA (B). The junction separation is 
3 mm and junction width 0.5 mm for both cases. The zero offset of A is due to a background mag- 
netic field. (After R. G. Jaklevic, J. Lambe, J. E. Mercereau, and A. H. Silver.) 


The current varies with <f> and has maxima when 

e<t>/hc = stt , s = integer . (61) 

The periodicity of the current is shown in Fig. 26. The short period varia- 
tion is produced by interference from the two junctions, as predicted by (61). 
The longer period variation is a diffraction effect and arises from the finite 
dimensions of each junction — this causes to depend on the particular path 
of integration (Problem 6), 


HIGH-TEMPERATURE SUPERCONDUCTORS 


High T c or IITS denotes superconductivity in materials, chiefly copper 
oxides, with high transition temperatures, accompanied by high critical cur- 
rents and magnetic fields. By 1988 the long-standing 23 K ceiling of T c in 
intermetallic compounds had been elevated to 125 K in bulk superconducting 
oxides; these passed the standard tests for superconductivity — the Meissner 
effect, ac Josephson effect, persistent currents of long duration, and substan- 
tially zero dc resistivity. Memorable steps in the advance include: 


BaPb 0 7 ;Bi 0 25 O 3 

La 1 . S5 Ba 0 _ 15 CuO 4 

YBa 2 Cu 3 0 7 

Tl 2 Ba 2 Ca 2 Cu 3 O 10 

Hgo.9Tlo.2Ba2Ca2Cu3O8.33 


T c = 12 K [BPBO] 

T c - 36 K [LBCO] 

T c = 90 K [YBCO] 

T c = 120 K [TBCO] 

T, = 138 K 



294 


SUMMARY 

(In CGS Units) 


• A superconductor exhibits infinite conductivity. 

• A bulk specimen of metal in the superconducting state exhibits perfect dia- 
magnetism, with the magnetic induction B = 0. This is the Meissner effect. 
The external magnetic field will penetrate the surface of the specimen over 
a distance determined by the penetration depth A . 

• There arc two types of superconductors, I and II. In a bulk specimen of type I 
superconductor the superconducting state is destroyed and the normal state is 
restored by application of an external magnetic field in excess of a critical value 
H c . A type II superconductor has two critical fields, H cl < H c < H r2 ; a vortex 
state exists in the range between If cl and H cl . The stabilization energy density of 
the pure superconducting state is ZZ 2 /8 -tt in both type I and II superconductors. 

• In the superconducting state an energy gap, E g = 4 k B T c . separates supercon- 
ducting electrons below from normal electrons above the gap. The gap is de- 
tected in experiments on heat capacity, infrared absorption, and tunneling. 

• Three important lengths enter the theory of superconductivity; the London 
penetration depth A L : the intrinsic coherence length £" 0 ; and the normal 
electron mean free path €. 


The London equation 


J = 


4 TT\f 


or 


curl i = — 


4ttA| 


B 


leads to the Meissner effect through tire penetration equation V 2 B = B/Af,, 
where *= (mc 2 /4iine 2 ) 1/2 is the London penetration depth. 

• In the London equation A or B should be a weighted average over the co- 
herence length £. The intrinsic coherence length £ 0 = 2hvj F )v E^. 

• The BCS theory accounts for a superconducting state formed from pairs of 
electrons kT and — k4-. These pairs act as bosons. 

• Typo II superconductors have £ < A. The critical fields are related by 
H cl ~ (£/A)H c and JI c2 = (A /£)H C . The Ginzburg-Landau parameter k is de- 
fined as A/£. 


Problems 

1. Magnetic field penetration in a plate. The penetration equation may be written 
as A 2 V 2 £ = B, where A is the penetration depth, (a) Show that B(x) inside a super- 
conducting plate perpendicular to the x axis and of thickness S is given by 


B(x) = B„ 


cosh (x/\) 
cosh (8/2A ) 
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where B a is the field outside the plate and parallel to it; here x = 0 is at the center 
of the plate, (b) The effective magnetization M(x) in the plate is defined by 
B(x) — B a = 4ttM(x). Show that, in CGS, 4 ttM(x) = — B„(1/8A 2 )(S 2 — 4x 2 ), for S < 
A. In SI we replace the 477 by p Q . 

2. Critical field of thin films, (a) Using the result of Problem lb, show that the free 
energy density at T = 0 K within a superconducting film of thickness 5 in an exter- 
nal magnetic field B a is given by, for 8 < A, 

(CGS) F s (x, B.) = U s (0) + (8 2 - 4x 2 )B 2 /64n\ 2 . 

In SI the factor tt is replaced by j p Q . We neglect a kinetic energy contribution to 
the problem, (b) Show that the magnetic contribution to F s when averaged over 
the thickness of the film is B 2 (8/A ) 2 /967 t. (c) Show that the critical field of the thin 
film is proportional to ( \/8)H c , where H c is the bulk critical field, if we consider 
only the magnetic contribution to U s . 

3. Two-fluid model of a superconductor. On the two-fluid model of a supercon- 
ductor we assume that at temperatures 0 < T < T c the current density may be 
written as the sum of the contributions of normal and superconducting electrons: 
j = j N 4- j s , where j v = tr 0 E and j s is given by the London equation. Here <r 0 is an 
ordinary normal conductivity, decreased by the reduction in the number of normal 
electrons at temperature T as compared to the normal state. Neglect inertial ef- 
fects on both j N and_/ s . (a) Show from the Maxwell equations that the dispersion re- 
lation connecting wavevector k and frequency cn for electromagnetic waves in the 
superconductor is 

(CGS) kV = 4rrcr 0 wi — c 2 A£ 2 + to 2 ; or 

(SI) Ire 2 = ((7,)/e 0 ) on — c 2 \J 2 + a> 2 ; 


where \\ is given by (14a) with n replaced by n s . Recall that curl curl B = — V 2 B. 
(b) If r is the relaxation time of the normal electrons and n N is their concentration, 
show by use of the expression a 0 = n N e 2 r/m that at frequencies o> 1/r the disper- 
sion relation does not involve the normal electrons in an important way, so that the 
motion of the electrons is described by the London equation alone. The super- 
current short-circuits the normal electrons. The London equation itself only holds 
true if fun is small in comparison with the energy gap. Note: The frequencies of 
interest are such that m at p , where oj p is the plasma frequency. 

*4. Structure of a vortex, (a) Find a solution to the London equation that has cylin- 
drical symmetry and applies outside a line core. In cylindrical polar coordinates, we 
want a solution of 


B - A 2 V 2 B = 0 


‘This problem is somewhat difficult. 
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that is singular at the origin and for which the total flux is the flux quantum: 


277 dp pB(p) ~ <t>„ . 

Jo 

The equation is in fact valid only outside the normal core of radius L. (b) Show that 
the solution has the limits 

B(p) = (A^ttA 2 ) ln(A/p) , (£<?p< A) 

B{p) — (<i V/2 7rA “) ! ttA / 2p) 1/5 exp(— p/A) . (p> A) 

5. London penetration depth, (a) Take the time derivative of the London equation 
(10) to show that dj/dt = (c 2 /47rA|)E. (b) If mdv/dt = q~E. as for free carriers of 
charge q and mass m, show that k\ = mc^Mimq 2 . 

6. Diffraction effect of Josephson junction. Consider a junction of rectangular cross 
section with a magnetic field B applied in the plane of the junction, normal to an 
edge of width w. Let the thickness of the junction be T. Assume for convenience 
that the phase difference of the two superconductors is rr! 2 when B ~ 0. Show that 
the dc current in the presence of the magnetic field is 

sin(t vTBe/hc) 
j'° (wTBe/fw) ’ 

7. Meissner effect in sphere. Consider a sphere of a type 1 superconductor with crit- 
ical field II... (a) Show that in the Meissner regime the effective magnetization M 
within the sphere is given by — 8 ttM/3 = B.. the uniform applied magnetic field, 
(b) Show' that the magnetic field at the surface of the sphere in the equatorial plane 
is 3BJ2. (It follows that the applied field at which the Meissner effect starts to break 
down is 211/3.) Reminder: The demagnetization field of a uniformly magnetized 
sphere is — 4 ttM/3. 


Reference 


An excellent superconductor review is the website supcrconductors.org. 
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NOTATION: In the problems treated in this chapter the magnetic field B is 
always closely equal to the applied field B a , so that we write B for B a in most 
instances. 






CHAPTER 11: DIAMAGNETISM AND PARAMAGNETISM 


Magnetism is inseparable from quantum mechanics, for a strictly classical 
system in thermal equilibrium can display no magnetic moment, even in a 
magnetic field. The magnetic moment of a free atom has three principal 
sources: the spin with which electrons are endowed; their orbital angular mo- 
mentum about the nucleus; and the change in the orbital moment induced by 
an applied magnetic field. 

The first two effects give paramagnetic contributions to the magnetiza- 
tion, and the third gives a diamagnetic contribution. In the ground Is state of 
the hydrogen atom the orbital moment is zero, and the magnetic moment is 
that of the electron spin along with a small induced diamagnetic moment. In 
the Is 2 state of helium the spin and orbital moments are both zero, and there is 
only an induced moment. Atoms with all filled electron shells have zero spin 
and zero orbital moment: finite moments are associated with unfilled shells. 

The magnetization M is defined as the magnetic moment per unit volume. 
The magnetic susceptibility per unit volume is defined as 

(CGS) * = §; (SI) (1) 

where B is the macroscopic magnetic field intensity. In both systems of units x 
is dimensionless. We shall sometimes for convenience refer to M/B as the sus- 
ceptibility without specifying the system of units. 

Quite frequently a susceptibility is defined referred to unit mass or to a 
mole of the substance. The molar susceptibility is written as Xm> the magnetic 
moment per gram is sometimes written as er. Substances with a negative mag- 
netic susceptibility are called diamagnetic. Substances with a positive suscep- 
tibility are called paramagnetic, as in Fig. 1. 

Ordered arrays of magnetic moments are discussed in Chapter 12; the ar- 
rays may be ferromagnetic, ferrimagnetic, antiferromagnetic, helical, or more 
complex in form. Nuclear magnetic moments give rise to nuclear paramag- 
netism. Magnetic moments of nuclei are of the order of 10~ 3 times smaller 
than the magnetic moment of the electron. 

LANGEVIN DIAMAGNETISM EQUATION 

Diamagnetism is associated with the tendency of electrical charges par- 
tially to shield the interior of a body from an applied magnetic field. In 
electromagnetism we are familiar with Lenz’s law: when the flux through an 
electrical circuit is changed, an induced (diamagnetic) current is set up in such 
a direction as to oppose the flux change. 
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In a superconductor or in an electron orbit within an atom, the induced 
current persists as long as the field is present. The magnetic field of the in- 
duced current is opposite to the applied field, and the magnetic moment asso- 
ciated with the current is a diamagnetic moment. Even in a normal metal 
there is a diamagnetic contribution from the conduction electrons, and this 
diamagnetism is not destroyed by collisions of the electrons. 

The usual treatment of the diamagnetism of atoms and ions employs the 
Larmor theorem: In a magnetic field the motion of the electrons around a 
central nucleus is, to the first order in B, the same as a possible motion in the 
absence of B except for the superposition of a precession of the electrons with 
angular frequency 

(CGS) cj = eB/2mc ; (SI) at = eB/2m. (2) 

If the field is applied slowly, the motion in the rotating reference system will 
be the same as the original motion in the rest system before the application of 
the field. 

If the average electron current around the nucleus is zero initially, the 
application of the magnetic field will cause a finite current around the nucleus. 
The current is equivalent to a magnetic moment opposite to the applied field. 
It is assumed that the Larmor frequency (2) is much lower than the frequency 
of the original motion in the central field. This condition is not satisfied in free 
carrier cyclotron resonance, and the cyclotron frequency of the carriers is 
twice the frequency (2). 

The Larmor precession of Z electrons is equivalent to an electric 
current 


(SI) 


I = (eharge)(revolutions per unit time) = ( — Ze)l 


2 IT 



(3) 


The magnetic moment p of a current loop is given by the product (cur- 
rent) X (area of the loop). The area of the loop of radius p is tt p 2 . We have 

(SI) p = -^(p2) , (CGS) p=-f^(p 2 >. (4) 

4m 4 me 2 

Here (p 2 ) = (x 2 ) + (y 2 ) is the mean square of the perpendicular distance of the 
electron from the field axis through the nucleus. The mean square distance of 
the electrons from the nucleus is (r 2 ) = (x 2 ) + (y 2 ) + (z 2 ). For a spherically 
symmetrical distribution of charge we have (x 2 ) = (y 2 ) = (% 2 ), so that 
<r 2 > = |(p 2 ). 
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From (4) the diamagnetic susceptibility per unit volume is, if N is the 
number of atoms per unit volume, 


(CGS) 



NZe 2 
6 me 2 


<r 2 > ; 


(5) 


(SI) 


_ lipNn _ iipNZe 2 2 

X B 6m V 7 


This is the classical Langevin result. 

The problem of calculating the diamagnetic susceptibility of an isolated 
atom is reduced to the calculation of (r 2 ) for the electron distribution within 
the atom. The distribution can be calculated by quantum mechanics. 

Experimental values for neutral atoms are most easily obtained for the 
inert gases. Typical experimental values of the molar susceptibilities are the 
following: 



He 

Ne 

Ar 

Kr 

Xe 

Xu in CGS in 10 6 cm 3 /mole: 

-1.9 

-7.2 

-19.4 

-28.0 

-43.0 


In dielectric solids the diamagnetic contribution of the ion cores is de- 
scribed roughly by the Langevin result. The contribution of conduction elec- 
trons in metals is more complicated, as is evident from the de Haas-van 
Alphen effect discussed in Chapter 9. 


QUANTUM THEORY OF DIAMAGNETISM OF MONONUCLEAR SYSTEMS 


We give the quantum treatment of the classical Langevin result. From 
Appendix (G.18) the effect of a magnetic field is to add to the Hamiltonian 
the terms 

o fe= Je^( V . A + A . V) + -^A 2 ; (6) 

2 mc K 2 me 2 

for an atomic electron these terms may usually be treated as a small perturba- 
tion. If the magnetic field is uniform and in the z direction, we may write 

A x = ~\yB , A tJ = $xB , A. = 0 , (7) 


and (6) becomes 


2d2 


iefiB( d d \ , e z B , 2 , ; 
2rnc \ dy ^ dx) 8t nc 2 * ^ 


( 8 ) 


The first term on the right is proportional to the orbital angular momen- 
tum component L, 2 if r is measured from the nucleus. In mononuclear systems 
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this term gives rise only to paramagnetism. The second term gives for a spheri- 
cally symmetric system a contribution 


E' 


2d2 


e z B 


AA) 


(9) 


12 me 2 

by first-order perturbation theory. The associated magnetic moment is 
diamagnetic: 

dE' e 
dB 




"HA) 


6 me 2 


B 


( 10 ) 


in agreement with the classical result (5). 


PARAMAGNETISM 

Electronic paramagnetism (positive contribution to x ) is found in: 

1. Atoms, molecules, and lattice defects possessing an odd number of 
electrons, as here the total spin of the system cannot be zero. Examples: free 
sodium atoms; gaseous nitric oxide (NO); organic free radicals such as tri- 
phenylmethyl, C(C 6 H S ) 3 ; F centers in alkali halides. 

2. Free atoms and ions with a partly filled inner shell: transition ele- 
ments; ions isoelectronic with transition elements; rare earth and actinide ele- 
ments. Examples: Mn 3 *, Gd 3+ , U 4+ . Paramagnetism is exhibited by many of 
these ions even when incorporated into solids, but not invariably. 

3. A few compounds with an even number of electrons, including molec- 
ular oxygen and organic biradicals. 

4. Metals. 


QUANTUM THEORY OF PARAMAGNETISM 


The magnetic moment of an atom or ion in free space is given by 

P = yfiJ= -g^jJ - 


(ID 


where the total angular momentum hj is the sum of the orbital ftL and spin hS 
angular momenta. 

The constant y is the ratio of the magnetic moment to the angular mo- 
mentum; y is called the gyromagnetic ratio or magnetogyric ratio. For 
electronic systems a quantity g called the g factor or the spectroscopic splitting 
factor is defined by 

g ^B^-yh . (12) 


For an electron spin g = 2.0023, usually taken as 2.00. For a free atom the 
g factor is given by the Lande equation 

J(J + 1) + S(S + 1) - L(L + 1) 
g = 1+ 


( 13 ) 
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<{ 2 p-B 



Figure 2 Energy level splitting for one elec- 
tron in a magnetic field B directed along the 
positive z axis. For an electron the magnetic 
moment fi is opposite in sign to the spin S, so 
that /x = — gju. B S. In the low-energy state the 
magnetic moment is parallel to the magnetic 
field. 



Figure 3 Fractional populations of a two-level 
system in thermal equilibrium at temperature T 
in a magnetic field B. The magnetic moment is 
proportional to the difference between the two 
curves. 


The Bohr magneton ju B is defined as eh.l2.mc in CGS and eh/2m in SI. It is 
closely equal to the spin magnetic moment of a free electron. 

The energy levels of the system in a magnetic field are 

U = -fL • B = m ; g/x B B , (14) 

where rtij is the azimuthal quantum number and has the values /, J — 1, , 

—J. For a single spin with no orbital moment we have rtij = ±| and g — 2, 
whence U = ±/a £ B. This splitting is shown in Fig. 2. 

If a system has only two levels the equilibrium populations are, with 

t = k B T, 

\\ _ exp (fi-B/r) 

N expf/tB/r) + exp ( — /xB/r) 

JV 2 _ exp(— p-B/r) 

A 7 exp(^tB/r) + exp(— jliB/t) 

here N u are the populations of the lower and upper levels, and N = N± + 
N 2 is the total number of atoms. The fractional populations are plotted in 
Fig- 3. 

The projection of the magnetic moment of the upper state along the field 
direction is —/a and of the lower state is ju. The resultant magnetization for N 
atoms per unit volume is, with x = fiB/k B T, 

M = (N 1 - N 2 )fi = Nfi • * = Nfi tanh * . (17) 

For x <•? 1, tanh x = x, and we have 


M = N/i(nB/k B T) . (18) 

In a magnetic field an atom with angular momentum quantum number J 
has 2J + 1 equally spaced energy levels. The magnetization (Fig. 4) is given by 

M = NgJ/i B Bj(x) , (x = gJpt B B/k B T) , (19) 
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Figure 4 Plot of magnetic moment versus BIT for spherical samples of (I) potassium chromium 
alum, (II) ferric ammonium alum, and (III) gadolinium sulfate octahydrate. Over 99.5% magnetic 
saturation is achieved at 1.3 K and about 50,000 gauss (5T). After W. E. Henry. 


where the Brillouin function Bj is defined by 


Bj(x) = 


2 /+ 1 

~W~ 


ctnh 






Equation (17) is a special case of (20) for J = §. 

For x = p, B/k B T < 1, we have 

ctnhx = | + |-|g+--- , 

and the susceptibility' is 


M ^ NJ(J + l)gVfi _ IVpVi _ C 
B 3k B T 3k B T T ' 


Here p is the effective number of Bohr magnetons, defined as 


p=g[/(/+Dr . 


( 20 ) 


( 21 ) 


( 22 ) 


(23) 
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Figure 5 Plot of \!\ vs T for a gadolinium salt, Gd(C2H 3 S0 4 )^ * 9H 2 0. The straight line is the 
Curie law. (After L. C. Jackson and H. Kamerlingh Onnes.) 


The constant C is known as the Curie constant. The form (19) is known as 
the Curie-Brillouin law, and (22) is known as the Curie law. Results for the 
paramagnetic ions in a gadolinium salt are shown in Fig. 5. 

Rare Earth Ions 

The ions of the rare earth elements (Table 1) have closely similar chemical 
properties, and their chemical separation in tolerably pure form was accom- 
plished only long after their discovery'. Their magnetic properties are fascinating: 
The ions exhibit a systematic variety and intelligible complexity. The chemical 
properties of the trivalent ions are similar because the outermost electron 
shells are identically in the 5s 2 5p s configuration, like neutral xenon. In lan- 
thanum, just before the rare earth group begins, the 4 f shell is empty; at 
cerium there is one 4 f electron, and the number of 4 f electrons increases 
steadily through the group until we have 4f 13 at ytterbium and the filled shell 
4 f u at lutecium. The radii of the trivalent ions contract fairly smoothly as we 
go through the group from 1.1 1 A at cerium to 0.94 A at ytterbium. This is 
known as the “lanthanide contraction.” What distinguishes the magnetic be- 
havior of one ion species from another is the number of 4 f electrons com- 
pacted in the inner shell with a radius of perhaps 0.3 A. Even in the metals the 
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Table 1 Effective magneton numbers p for trivalent lanthanide group ions 


(Near room temperature) 

Ion 

Configuration 

Basic level 

p(cale) = 

glJ(J + 1)1 1/2 

p(exp), 

approximate 

Ce 3+ 

4f 1 5s 2 p 6 


2.54 

2.4 

Pr 3+ 

4f5 S y 

y 

3.58 

3.5 

Nd 3+ 

4/ 3 5s 2 p 6 

4 T 

i 9/2 

3.62 

3.5 

Pm 3 * 

4f'5s 2 p 6 

% 

2.68 

— 

Sm !+ 

4f5sY 

y 2 

0.84 

1.5 

Eu 3+ 

4f5sY 

7 F n 

0 

3.4 

Gd 31 

4f 7 5sY 

8 ^7/2 

7.94 

8.0 

Tb 3+ 

4f5sY 

7 F 6 

9.72 

9.5 

Dy 3+ 

4/ 9 5.?y 

8 w 1s/2 

10.63 

10.6 

Ho ,+ 

4/ iw 5. s y 

5 r 

l s 

10.60 

10.4 

Er 3+ 

4f u 5 S y 

4 I\S/2 

9.59 

9.5 

Tm 3+ 

4f*5sY 


7.57 

7.3 

Yb' ,+ 

4/ 13 5$y 


4.54 

4.5 


4 If core retains its integrity and its atomic properties: no other group of ele- 
ments in the periodic table is as interesting. 

The preceding discussion of paramagnetism applies to atoms that have a 
(2 J + l)-fold degenerate ground state, the degeneracy being lifted by a mag- 
netic field. The influence of all higher energy states of the system is neglected. 
These assumptions appear to be satisfied by a number of rare-earth ions. 
Table 1. The calculated magneton numbers are obtained with g values from 
the Landc result (13) and the ground-state level assignment predicted below 
by the Hund theory of spectral terms. The discrepancy between the experi- 
mental magneton numbers and those calculated on these assumptions is quite 
marked for Eu s+ and Sm 3+ ions. For these ions it is necessary to consider the 
influence of the high states of the L — S multiplet, as the intervals between 
successive states of the multiplet are not large compared to h B T at room tem- 
perature. A multiplet is the set of levels of different J values arising out of a 
given L and S. The levels of a multiplet are split by the spin-orbit interaction. 

Hund Rules 

The Hund rules as applied to electrons in a given shell of an atom affirm 
that electrons will occupy orbitals in such a way that the ground state is char- 
acterized by the following: 

1. The maximum value of the total spin S allowed by the exclusion principle; 

2. The maximum value of the orbital angular momentum L consistent 
with this value of S; 
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3. The value of the total angular momentum / is equal to I L — SI when the 
shell is less than half full and to L + S when the shell is more than half full. When 
the shell is just half full, the application of the first rule gives L = 0, so that/ = S. 

The first Hund rule has its origin in the exclusion principle and the coulomb 
repulsion between electrons. The exclusion principle prevents two electrons of 
the same spin from being at the same place at the same time. Thus electrons of 
the same spin are kept apart, further apart than electrons of opposite spin. Be- 
cause of the coulomb interaction the energy of electrons of the same spin is 
lower — the average potential energy is less positive for parallel spin than for 
antiparallel spin. A good example is the ion Mn 2+ . This ion has five electrons in 
the 3d shell, which is therefore half-filled. The spins can all be parallel if each 
electron enters a different orbital, and there are exactly five different orbitals 
available, characterized by the orbital quantum numbers rn L = 2, 1, 0, —1, —2. 
These will be occupied by one electron each. We expect S - and because 
Xm L = 0 the only possible value of L is 0, as observed. 

The second Hund rule is best approached by model calculations. Pauling 
and Wilson, 1 for example, give a calculation of the spectral terms that arise 
from the configuration p 2 . The third Hund rule is a consequence of the sign of 
the spin-orbit interaction: For a single electron the energy is lowest when the 
spin is antiparallel to the orbital angular momentum. But the low-energy pairs 
m L , m s are progressively used up as we add electrons to the shell; by the exclu- 
sion principle when the shell is more than half full the state of lowest energy 
necessarily has the spin parallel to the orbit. 

Consider two examples of the Hund rules: The ion Cc 3+ has a single/ 
electron; an/ electron has l = 3 and s — |. Because the/ shell is less than half 
full, the / value by the preceding rule is IL — SI = L — l = f. The ion Pr 3+ has 
two / electrons; one of the rules tells us that the spins add to give S — 1. Both/ 
electrons cannot have m l = 3 without violating the Pauli exclusion principle, so 
that the maximum L consistent with the Pauli principle is not 6, but 5. The / 
value is IL — SI = 5 — 1 = 4. 

Iron Group Ions 

Table 2 show's that the experimental magneton numbers for salts of the 
iron transition group of the periodic table are in poor agreement with (23). 
The values often agree quite well with magneton numbers p = 2[S(S + 1 )] 1/2 
calculated as if the orbital moment were not there at all. 

Crystal Field Splitting 

The difference in behavior of the rare earth and the iron group salts is that 
the 4/ shell responsible for paramagnetism in the rare earth ions lies deep 

*L, Pauling and E. B, Wilson. Introduction to quantum mechanics, McGraw-Hill, 1935, 
pp. 239—246. See also Dover Reprint. 
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Table 2 Effective magneton numbers for iron group ions 


Ion 

Configuration 

Basic 

level 

p(calc) = 

g \j(] + di 1/2 

p(calc) = 
2[S(S + l)] 192 

p(cxp)“ 

Ti 3+ , V 4+ 

3d 1 

~D m 

1.55 

1.73 

1.8 

y3 + 

3 d 2 

3 Fo 

1.63 

2. S3 

2.8 

Cr 3+ , V 2+ 

3d 3 


0.77 

3.87 

3.8 

Mn ,1+ , Cr 2+ 

3d 4 

5 d 

u o 

0 

4.90 

4.9 

Fe 3 \ Mn 2+ 

3d 5 

6 S.,/2 

5.92 

5.92 

5.9 

Fe 2+ 

3d 6 

5 d 4 

6.70 

4.90 

5.4 

Co 2+ 

3d‘ 

"F 9 ;o 

6.63 

3.87 

4.8 

Ni 2+ 

3d s 


5.59 

2.83 

3.2 

Ci. 2+ 

,3d 9 

2 A>/2 

3.55 

1.73 

1.9 


“Representative values. 


inside the ions, within the 5s and 5 p shells, whereas in the iron group ions the 
3 d shell responsible for paramagnetism is the outermost shell. The 3 d shell ex- 
periences the intense inhomogeneous electric field produced by neighboring 
ions. This inhomogeneous electric field is called the crystal field. The inter- 
action of the paramagnetic ions with the crystal field has two major effects: 
The coupling of L and S vectors is largely broken up, so that the states are no 
longer specified by their / values; further, the 2L + 1 sublevels belonging to a 
given L which are degenerate in the free ion may now' be split by the crystal 
field, as in Fig. 6. This splitting diminishes the contribution of the orbital mo- 
tion to the magnetic moment. 

Quenching of the Orbital Angular Momentum 

In an electric field directed toward a fixed nucleus, the plane of a classical 
orbit is fixed in space, so that all the orbital angular momentum components 
L x , L ;i , L, are constant. In quantum theory one angular momentum compo- 
nent, usually taken as L z , and the square of the total orbital angular momen- 
tum L 2 are constant in a central field. In a noncentral field the plane of the 
orbit will move about; the angular momentum components are no longer con- 
stant and may average to zero. In a crystal L, will no longer be a constant of 
the motion, although to a good approximation L 2 may continue to be constant. 
When L z averages to zero, the orbital angular momentum is said to be 
quenched. The magnetic moment of a state is given by the average value of the 
magnetic moment operator /a b (L + 2S). In a magnetic field along the z direc- 
tion the orbital contribution to the magnetic moment is proportional to the 
quantum expectation value of L z ; the orbital magnetic moment is quenched if 
the mechanical moment L z is quenched. 
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Figure 6 Consider an atom with orbital angular momentum L = 1 placed in the uniaxial crys- 
talline electric field of the two positive ions along the z axis. In the free atom the states m L = ±1,0 
have identical energies — they are degenerate. In the crystal the atom has a lower energy when the 
electron cloud is close to positive ions as in (a) than when it is oriented midway between them, as 
in (b) and (c). The wavefunctions that give rise to these charge densities are of the form zf(r), xf(r) 
and yf(r) and are called the p., p s , p y orbitals, respectively. In an axially symmetric field, as shown, 
the p x and p y orbitals are degenerate. The energy levels referred to the free atom (dotted line) are 
shown in (d). If the electric field does not have axial symmetry, all three states will have different 
energies. 


As an example, consider a single electron with orbital quantum number 
L — 1 moving about a nucleus, the whole being placed in an inhomogeneous 
crystalline electric field. We omit electron spin. 

In a crystal of orthorhombic symmetry the charges on neighboring ions 
will produce an electrostatic potential <p about the nucleus of the form 

e<p = Ax 2 + By 2 — (A + B)z 2 , (24) 

where A and B are constants. This expression is the lowest degree polynomial 
in x, y, z which is a solution of the Laplace equation V 2 <p = 0 and compatible 
with the symmetry of the crystal. 

In free space the ground state is three-fold degenerate, with magnetic 
quantum numbers m L = 1, 0, —1. In a magnetic field these levels are split by 
energies proportional to the field B, and it is this field-proportional splitting 
which is responsible for the normal paramagnetic susceptibility of the ion. In 
the crystal the picture may be different. We take as the three wavefunctions 
associated with the unperturbed ground state of the ion 

U x = xf(r) ; U y = yf(r) ; U, = zf(r) . (25) 
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These wavefunctions are orthogonal, and we assume that they are normalized. 
Each of the ITs can be shown to have the property 

2*17, = L(L + l)C7j = 2 U, , (26) 

where if 2 is the operator for the square of the orbital angular momentum, in 
units of ti. The result (26) confirms that the selected wavefunctions are in fact 
p functions, having L = 1 . 


We observe now that the If s are diagonal with respect to the perturbation, 
as by symmetry the nondiagonal elements vanish: 

(U x \e<p\U y ) = (U x \e<p\U z ) = {U y \e<p\l].) = 0 . (27) 

Consider for example, 

(U x \e<p\U y ) = S xy\f(r)\ 2 {Ax z + By 2 — (A + B)z 2 } dxdy dz ■ (28) 


the integrand is an odd function of x (and also of y ) and therefore the integral 
must be zero. The energy levels are then given by the diagonal matrix 
elements: 


where 


(U x \e<p\U x ) = f |/(r) \ 2 {Ax 4 + Bifx 2 — (A + B)z 2 x 2 } dx dy dz 
= A(/,-/ 2 ) , 


(29) 


— f | fir) \ 2 x 4 dx dy dz ; l 2 = f |/(r) 1 2 x 2 y 2 dx dy dz . 

In addition, 


(U 9 \e<p\U v ) = B(I, - h) ; (U z \e<p\U z ) - -(A + B)(7 1 - Z 2 ) . 

The three eigenstates in the crystal field are p functions with their angular 
lobes directed along each of the x, y, z axes, respectively. 

The orbital moment of each of the levels is zero, because 


<L7 I |L,|t7 I > = {U y \L z \U y ) = (I7jLjUj = 0 . 


The level still has a definite total angular momentum, since X 2 is diagonal and 
gives L = 1, but the spatial components of the angular momentum are not 
constants of the motion and their time average is zero in the first approxima- 
tion. Therefore the components of the orbital magnetic moment also vanish in 
the same approximation. The role of the crystal field in the quenching process 
is to split the originally degenerate levels into nonmagnetic levels separated by 
energies p.H, so that the magnetic field is a small perturbation in compari- 
son with the crvstal field. 

✓ 

At a lattice site of cubic symmetry there is no term in the potential of the 
form (24), that is, quadratic in the electron coordinates. Now the ground state 
of an ion with one p electron (or with one hole in a p shell) will be triply 
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degenerate. However, the energy of the ion will be lowered if the ion displaces 
itself with respect to the surroundings, thereby creating a noncubic potential 
such as (24). Such a spontaneous displacement is known as a Jahn-Teller 
effect and is often large and important, particularly with the Mn 3+ and Cu 2+ 
ions and with holes in alkali and silver halides. 

Spectroscopic Splitting Factor 

We suppose for convenience that the crystal field constants. A, B are such 
that U x — xf(r) is the orbital wave function of the ground state of the atom in 
the crystal. For a spin S = \ there are two possible spin states S z — ±| repre- 
sented by the spin functions a, /3, which in the absence of a magnetic field are 
degenerate in the zeroth approximation. The problem is to take into account 
the spin-orbit interaction energy AL ■ S. 

If the ground state function is i// 0 = U x a = xf(r)a in the zeroth approxima- 
tion, then in the first approximation, considering the AL • S interaction by 
standard perturbation theory, we have 

if/ = [U x - i(A/2A 1 )L/ J ]a - i(\/2A 2 )UJ3 , (30) 

where A, is the energy difference between the U x and U y states, and A 2 is the 
difference between the U x and U. states. The term in UJ3 actually has only a 
second-order effect on the result and may be discarded. The expectation value 
of the orbital angular momentum to the first order is given directly by 

(if/\L z \if/) = -A/A, , 

and the magnetic moment of the state as measured in the z direction is 
fXg(if/\ L, T 2S. |(/») = [ — (A/Aj) + l]/ig . 

As the separation between the levels S. = ±| in a field H is 
A£ = gp B H = 2[1 - (A/A,) Wf , 

the g value or spectroscopic splitting factor (12) in the z direction is 

g = 2[1 - (A/A,)] . (31) 

Van Vleck Temperature-Independent Paramagnetism 

We consider an atomic or molecular system which has no magnetic mo- 
ment in the ground state, by which we mean that the diagonal matrix element 
of the magnetic moment operator /a. is zero. 

Suppose that there is a nondiagonal matrix element (s|ju,|0) of the mag- 
netic moment operator, connecting the ground state 0 with the excited state s 
of energy A = E s — E 0 above the ground state. Then by standard perturbation 
theory the wavefunction of the ground state in a weak field (p 2 B < A) becomes 

#b = *o + (B/A)<sk|0>*, , (32) 
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and the wavefunction of the excited state becomes 

<14 = >K~ (B/AXOUIs)^ . (33) 

The perturbed ground state now has a moment 

<OV 3 |0'> = 2fi|<s|A4 z |0)| 2 /A , (34) 

and the upper state has a moment 

- -2B|<s|^|0)| 2 /A . (35) 

There are two interesting cases to consider; 

Case (a). A k B T. The surplus population in the ground state over the 
excited state is approximately equal to NA/2k B T, so that the resultant magneti- 
zation is 


M = 


2B]<s|^[0)P 

A 


NA 
2k B T ’ 


which gives for the susceptibility 


y-iVKs|jaJO)| 2 /fc B T . 


(36) 

(37) 


Here N is the number of molecules per unit volume. This contribution is of 
the usual Curie form, although the mechanism of magnetization here is by po- 
larization of the states of the system, whereas with free spins the mechanism of 
magnetization is the redistribution of ions among the spin states. We note that 
the splitting A does not enter in (37). 

Case (b). A > k B T. Here ihe population is nearly all in the ground state, 
so that 

2JVB|(s|p.,|0>| 2 , , 

M = . (38) 


The susceptibility is 


X = 


2'V|(s|/T-jO)| a 

A 


(39) 


independent of temperature. This type of contribution is known as Van Vleck 
paramagnetism. 


COOLING BY ISENTROFIC DEMAGNETIZATION 

The first method for attaining temperatures much below 1 K was that of 
isentropic, or adiabatic, demagnetization of a paramagnetic salt. By its use, 
temperatures of 10~ 3 K and lower have been reached. The method rests on the 
fact that at a fixed temperature the entropy of a system of magnetic moments 
is lowered by the application of a magnetic field. 
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Before j New equilibrium 


Time at which 
magnetic field 
is removed 



Before | New equilibrium 


Time at which 
magnetic field 
is removed 


Figure 7 During isentropic demagnetization the total entropy of the specimen is constant. For 
effective cooling the initial entropy of the lattice should be small in comparison with the entropy 
of the spin system. 


The entropy is a measure of the disorder of a system: the greater the dis- 
order, the higher is the entropy. In the magnetic field the moments will be 
partly lined up (partly ordered), so that the entropy is lowered by the field. 
The entropy is also lowered if the temperature is lowered, as more of the mo- 
ments line up. 

If the magnetic field can then be removed without changing the entropy 
of the spin system, the order of the spin system will look like a lower tempera- 
ture than the same degree of order in the presence of the field. When the 
specimen is demagnetized at constant entropy, entropy can flow into the spin 
system only from the system of lattice vibrations, as in Fig. 7. At the tempera- 
tures of interest the entropy of the lattice vibrations is usually negligible, thus 
the entropy of the spin system will be essentially constant during isentropic 
demagnetization of the specimen. Magnetic cooling is a one-shot operation, 
not cyclic. 

We first find an expression for the spin entropy of a system of IV ions, each 
of spin S, at a temperature sufficiently high that the spin system is entirely dis- 
ordered. That is, T is supposed to he much higher than some temperature A 
which characterizes the energy of the interactions (£ int = k B A) tending to 
orient the spins preferentially. Some of these interactions are discussed in 
Chapter 12. The definition of the entropy a of a system of G accessible states 
is tr = k B In G. At a temperature so high that all of the 2S + 1 states of each 
ion are nearly equally populated, G is the number of ways of arranging N spins 
in 2S + 1 states. Thus G = (2S + l) w , whence the spin entropy (T s is: 

(7g = k B In (2S + if = Nk s In (2S + 1) . 


(40) 
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Figure 8 Entropy for a spin f system as a function of temperature, assuming an internal random 
magnetic field of 100 gauss. The specimen is magnetized isothermally along ab, and is then 
insulated thermally. The external magnetic field is turned off along be. In order to keep the figure 
on a reasonable scale the initial temperature Ti is lower than would be used in practice, and so is 
the external magnetic field. 


This spin entropy is reduced by a magnetic field if the lower levels gain in 
population when the field separates the 2S + 1 states in energy. 

The steps carried out in the cooling process are shown in Fig. 8. The field 
is applied at temperature with the specimen in good thermal contact with 
the surroundings, giving the isothermal path ab. The specimen is then insu- 
lated (Act = 0) and the field removed, the specimen follows the constant en- 
tropy path be , ending up at temperature T 2 . The thermal contact at T 1 is pro- 
vided by helium gas, and the thermal contact is broken by removing the gas 
with a pump. 

The population of a magnetic sublevel is a function only of / iB/k B T , hence 
of BIT. The spin-system entropy is a function only of the population distribu- 
tion; hence the spin entropy is a function only of BIT. If B A is the effective 
field that corresponds to the local interactions, the final temperature T 2 
reached in an isentropic demagnetization experiment is 


T 2 = UBJB) , 


(41) 


where B is the initial field and 7’, the initial temperature. 


Nuclear Demagnetization 

Because nuclear magnetic moments are weak, nuclear magnetic interac- 
tions are much weaker than similar electronic interactions. We expect to reach 
a temperature 100 times lower with a nuclear paramagnet than with an elec- 
tron paramagnet. The initial temperature T 1 of the nuclear stage in a nuclear 
spin-cooling experiment must be lower than in an electron spin-cooling 
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Initial magnetic field in lcG 



Figure 9 Nuclear demagnetizations of copper nuclei in the metal, starting from 0.012 K and 
various fields. (After M. V. Hobden and N. Kuril.) 


experiment. If we start at B = 50 kG and T 1 = 0.01 K, then /j,B/k B Ti = 0.5, 
and the entropy decrease on magnetization is over 10 percent of the maximum 
spin entropy. This is sufficient to overwhelm the lattice and from (41) we esti- 
mate a final temperature T 2 = 10“' K. The first nuclear cooling experiment 
was carried out on Cu nuclei in the metal, starting from a first stage at about 
0.02 K as attained by electronic cooling. The lowest temperature reached was 
1.2 X 10~ 6 K. 

The results in Fig. 9 fit a line of the form of (41): T 2 = Tj^.l/B) with B in 
gauss, so that B A — 3.1 gauss. This is the effective interaction field of the 
magnetic moments of the Cu nuclei. The motivation for using nuclei in a metal 
is that conduction electrons help ensure rapid thermal contact of lattice and 
nuclei at the temperature of the first stage. 


PARAMAGNETIC SUSCEPTIBILITY OF CONDUCTION ELECTRONS 

We are going to try to show how on the basis of these statistics the fact that 
many metals arc diamagnetic, or only weakly paramagnetic, can be brought into 
agreement with the existence of a magnetic moment of the electrons. 

W. Pauli, 1927 

Classical free electron theory gives an unsatisfactory account of the para- 
magnetic susceptibility of the conduction electrons. An electron has associated 
with it a magnetic moment of one Bohr magneton, p. B . One might expect that 
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the conduction electrons would make a Curie-type paramagnetic contribution 
(22) to the magnetization of the metal: M = Njx\B/k B T. Instead it is observed 
that the magnetization of most normal nonferromagnetic metals is indepen- 
dent of temperature. 

Pauli showed that the application of the Fermi-Dirac distribution (Chapter 6) 
would correct the theory as required. We first give a qualitative explanation. 
The result (18) tells us that the probability an atom will be lined up parallel to 
the field B exceeds the probability of the antiparallel orientation by roughly 
IJ,B/k B T. For N atoms per unit volume, this gives a net magnetization 
=IVjU, 2 B/k B T, the standard result. 

Most conduction electrons in a metal, however, have no possibility of turning 
over when a field is applied, because most orbitals in the Fermi sea with parallel 
spin are already occupied. Only the electrons within a range k B T of the top of the 
Fermi distribution have a chance to turn over in the field; thus only the fraction 
T/T F of the total number of electrons contribute to the susceptibility. Hence 


Nil 2 B 




k B T 


_T 

T f 


k B T F 


(42) 


which is independent of temperature and of the observed order of 
magnitude. 

We now calculate the expression for the paramagnetic susceptibility of a 
free electron gas at T < T F . We follow the method of calculation suggested by 
Fig. 10. An alternate derivation is the subject of Problem 5. 


Total energy, kinetic + 
magnetic, of electrons 



Density of 
orbitals 


Figure 10 Pauli paramagnetism at absolute zero; the orbitals in the shaded regions in (a) are 
occupied. The numbers of electrons in the “up” and “down” band will adjust to make the energies 
equal at the Fermi level. The chemical potential (Fermi level) of the moment up electrons is equal 
to that of the moment down electrons. In (b) we show the excess of moment up electrons in the 
magnetic field. 
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The concentration of electrons with magnetic moments parallel to the 
magnetic field is 

= \ J * ^deDie + fjtB) s I JJ F de D(e) +^B D(e F ) , (43) 

written for absolute zero. Here |D(e + fiB) is the density of orbitals of one 
spin orientation, with allowance for the downward shift of energy by —/xB. 
The approximation is written for k B T < e F . 

The concentration of electrons with magnetic moments antiparallel to the 
magnetic field is 

Al_ = |J^d6D(e- /i B) a \ f‘ F de D(«) - | fiB D(e r ) . (44) 

The magnetization is given by M = /x(N+ — A T _), so that 

M = M 2 D(e F )fi=|^-B , (45) 


with D(e F ) — 3N/2e F — 3N/2k B T F from Chapter 6. The result (45) gives the 
Pauli spin magnetization of the conduction electrons, for k ti T < e F . 

In deriving the paramagnetic susceptibility, we have supposed that 
the spatial motion of the electrons is not affected by the magnetic field. But 
the wavefu notions are modified by the magnetic field; Landau has shown that 
for free electrons this causes a diamagnetic moment equal to — | of the para- 
magnetic moment. Thus the total magnetization of a free electron gas is 


M = 


nA 

k B T F 


B . 


(46) 


Before comparing (46) with the experiment we must take account of the 
diamagnetism of the ionic cores, of band effects, and of electron-electron in- 
teractions. In sodium the interaction effects increase the spin susceptibility by 
perhaps 75 percent. 

The magnetic susceptibility is considerably higher for most transition 
metals (with unfilled inner electron shells) than for the alkali metals. The high 
values suggest that the density of orbitals is unusually high for transition met- 
als, in agreement with measurements of the electronic heat capacity. We saw 
in Chapter 9 how this arises from band theory. 


SUMMARY 

(In CGS Units) 

• The diamagnetic susceptibility of N atoms of atomic number Z is x = 
— Z^A^r^Armc 2 , where (r 2 ) is the mean square atomic radius. (Langevin) 

• Atoms with a permanent magnetic moment fi have a paramagnetic suscepti- 
bility a — NfjL 2 /3k B T, for /xB < k B T. (Curie-Langevin) 
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•For a system of spins S = §, the exact magnetization is M — 
Nfi tanh (fiB/k e T), where /jl = (Brillouin) 

• The ground state of electrons in the same shell have the maximum value of 
S allowed by the Pauli principle and the maximum L consistent with this S. 
The J value is L + S if the shell is more than half full and 1L — Si if the shell 
is less than half full. 

• A cooling process operates by demagnetization of a paramagnetic salt at 
constant entropy. The final temperature reached is of the order of 
{BJB)T m |. ia ], where B 4 is the effective local field and B is the initial applied 
magnetic field. 

• The paramagnetic susceptibility of a Fermi gas of conduction electrons is 
X = 3Nfi 2 /2e F , independent of temperature for k v T < e F . (Pauli) 


Problems 

1. Diamagnetic susceptibility of atomic hydrogen. The wave function of the 

hydrogen atom in its ground state (Is) is t)/ = exp(—r/a„), where 

a (j = h 2 /me 2 = 0.529 X 10 _s cm. The charge density is p(x, y, z) = — <i<//l 2 , according 
to the statistical interpretation of the wave function. Show that for this state 
(r 2 ) = 3/7, i and calculate the molar diamagnetic susceptibility of atomic hydrogen 
(-2.36 X 10 6 cmVmole). 

2. Hund rules. Apply the Hund rules to find the ground state (the basic level in the 
notation of Table 1) of (a) Eu 1 ‘ + , in the configuration 4 f~ 5 s' 2 p 6 ; (b) Yb 3+ ; (c) Tb 3+ . 
The results for (b) and (c) are in Table 1, but you should give the separate steps in 
applying the rules. 

3. Triplet excited states. Some organic molecules have a triplet (S = 1) excited state 
at an energy k B A above a singlet (S = 0) ground state, (a) Find an expression for the 
magnetic moment (fz) in a field B. (b) Show that the susceptibility for T > A is ap- 
proximately independent of A. (c) With the help of a diagram of energy levels versus 
field and a rough sketch of entropy versus field, explain how this system might be 
cooled by isentropic magnetization (not demagnetization). 

4. Heat capacity from internal degrees of freedom, (a) Consider a two-level system 
with an energy splitting k E A between upper and lower states; the splitting may arise 
from a magnetic field or in other ways. Show that the heat capacity per system is 

r _(du\ (A rrfe* 1 

Ut/a s (l+ e i/r ) 2 ' 

The function is plotted in Fig. 11. Peaks of this type in the heat capacity are often 
known as Schottky anomalies. The maximum heat capacity is quite high, but for 
T < A and for T > A the heat capacity is low. (b) Show that for T > A we have 
C = fc B (A/2T) 2 + . . . . The hyperfine interaction between nuclear and electronic mag- 
netic moments in paramagnetic salts (and in systems having electron spin order) causes 
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Figure 11 Heat capacity of a two-level system as a function of 77 A, where A is the level splitting. 
The Schottky anomaly in the heat capacity is a very useful tool for determining energy level split- 
tings of ions in rare-earth and transition-group metals, compounds, and alloys. 



Figure 12 The normal-state heat capacity of gallium 
at T < 0.21 K. The nuclear quadrupole (C T~ 2 ) and 
conduction electron (C T) contributions dominate 
the heat capacity at very low temperatures. (After 
N. E. Phillips.) 


splittings with A ~ 1 to 100 mK. These splittings are often detected experimentally by 
the presence of a term in l/T 2 in the heat capacity in the region T > A. Nuclear electric 
quadrupole interactions with crystal fields also cause splittings, as in Fig. 12. 

5. Pauli spin susceptibility. The spin susceptibility of a conduction electron gas at 
alsolute zero may be approached by another method. Let 


N + = iV(l + f) ; 







be the concentrations of spin-up and spin-down electrons, (a) Show that in a mag- 
netic field B the total energy of the spin-up band in a free electron gas is 

E' = E„(l + f) M - + 0 , 

where E 0 = in terms of the Fermi energy e F in zero magnetic field. Find a 

similar expression for E~. (b) Minimize E tota ] = E~ — E~ with respect to l and solve 
for the equilibrium value of £ in the approximation C < 1. Go on to show that the 
magnetization is M = 3Nfi 2 B/2e F , in agreement with Eq. (45). 

6. Conduction electron ferromagnetism. We approximate the effect of exchange 
interactions among the conduction electrons if we assume that electrons with paral- 
lel spins interact with each other with energy —V, and V is positive, while electrons 
with antiparallel spins do not interact with each other, (a) Show with the help of 
Problem 5 that the total energy of the spin-up band is 


E + = E 0 (l + 0™ ~ |V-V 2 (1 + - |A>B(1 + C) ; 


find a similar expression for E . (b) Minimize the total energy and solve for l in the 
limit f < 1. Show that the magnetization is 


M = 


av> 2 .. 

2e F -l VN ’ 


so that the exchange interaction enhances the susceptibility, (c) Show that with 
B = 0 the total energy is unstable at = 0 when V > 4c f -/3-V. If this is satisfied, a 
ferromagnetic state (£ # 0) will have a lower energy than the paramagnetic state. 
Because of the assumption f 1, this is a sufficient condition for ferromagnetism, 
hut it may not be a necessary' condition, ft is known as the Stoner condition. 


7. Two-level system. The result of Problem 4 is often seen in another form, (a) If the 
two energy levels are at A and —A, show that the energy and heat capacity are 


U = —A tanhtA/fcijT) ; C = k h {kJk B Tf sech^A/ftjjT) . 

(b) If the system has a random composition such that all values of A arc equally 
likely up to some limit A 0> show that the heat capacity is linearly proportional to the 
temperature, provided k B T A 0 . This result was applied to the heat capacity of di- 
lute magnetic alloys by W. Marshall, Phys. Rev. 118, 1519 (1960). It is also used in 
the theory of glasses. 

8. Paramagnetism of S = 1 system, (a) Find the magnetization as a function of 
magnetic field and temperature for a system of spins with S — 1, moment ft. and 
concentration n. (b) Show that in the limit fiB kT the result is M = {2.nft l !?,kT)B . 
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CHAPTER 12 : FERROMAGNETISM AND ANTIFERROMAGNETISM 


FERROMAGNETIC ORDER 

A ferromagnet has a spontaneous magnetic moment — a magnetic moment 
even in zero applied magnetic field. The existence of a spontaneous moment 
suggests that electron spins and magnetic moments are arranged in a regular 
manner. The order need not be simple: all of the spin arrangements sketched 
in Fig. 1 except the simple antiferromagnet have a spontaneous magnetic 
moment, called the saturation moment, 

Curie Point and the Exchange Integral 

Consider a paramagnet with a concentration of N ions of spin S. Given an 
internal interaction tending to line up the magnetic moments parallel to each 
other, we shall have a ferromagnet. Let us postulate such an interaction and call 
it the exchange field . 1 * The orienting effect of the exchange field is opposed by 
thermal agitation, and at elevated temperatures the spin order is destroyed. 

We treat the exchange field as equivalent to a magnetic field B ; . The mag- 
nitude of the exchange field may be as high as 10 7 gauss (10 3 tesla). We assume 
that B e is proportional to the magnetization M. 

The magnetization is defined as the magnetic moment per unit volume; 
unless otherwise specified it is understood to be the value in thermal equilib- 
rium at the temperature T. If domains (regions magnetized in different direc- 
tions) are present, the magnetization refers to the value within a domain. 

In the mean-field approximation we assume each magnetic atom expe- 
riences a field proportional to the magnetization: 

Bj, = AM , (1) 

where A is a constant, independent of temperature. According to (1), each spin 
sees the average magnetization of all the other spins. In truth, it may sec only 
near neighbors, but our simplification is good for a first look at the problem. 

The Curie temperature T c is the temperature above which the sponta- 
neous magnetization vanishes; it separates the disordered paramagnetic phase 
at T > T c from the ordered ferromagnetic phase at T < T c . We can find T r . in 
terms of the constant A in (1). 


1 Also called the molecular field or the Weiss field, after Pierre Weiss who was the first to 

imagine such a field. The exchange field B t simulates a real magnetic field in the expressions for 

the energy — fi • B E and the torque jx X B E on a magnetic moment /a. But B E is not really a 

magnetic field and therefore does not enter into the Maxwell equations; for example, there is no 

current density j related to B K by curl H = 47rj/c. The magnitude of B e is typically 10 4 * * larger than 

the average magnetic field of the magnetic dipoles of the ferromagnet. 
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Consider the paramagnetic phase: an applied field B a will cause a finite 
magnetization and this in turn will cause a finite exchange field B E . If x P is the 
paramagnetic susceptibility, 

(CCS) M = x P (B a + B E ); (SI) = X,,(B a + B h ) . (2) 

The magnetization is equal to a constant susceptibility times a field only if the 
fractional alignment is small: this is where the assumption enters that the spec- 
imen is in the paramagnetic phase. 

The paramagnetic susceptibility (Chapt. 11) is given by the Curie law 
X p = C/T, where C is the Curie constant. Substitute (1) in (2); we find 
MT = C(B a + AM) and 

(CGS) x = f. = (T^cXj ■ (3) 

The susceptibility (3) has a singularity at T — CA. At this temperature (and 
below) there exists a spontaneous magnetization, because if x is infinite we 
can have a finite M for zero B a . From (3) we have the Curie-Weiss law 

(CCS) X = jSf ; T c = C\ . (4) 

This expression describes fairly well the observed susceptibility variation in 
the paramagnetic region above the Curie point. The reciprocal susceptibility 
of nickel is plotted in Fig. 2. 



350 400 450 500 

Temperature in °C 


Figure 2 Reciprocal of the susceptibility per gram of nickel in the neighborhood of the Curie 
temperature (358°C). The density is p. The dashed line is a linear extrapolation from high 
temperatures. (After P. Weiss and R. Forrer.) 
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From (4) and the definition (11.22) of the Curie constant C we may deter- 
mine the value of the mean field constant A in (1): 


(CCS) 


T c _ 3 k B T c 

c. Ng 2 S(S + l)/4 ' 


(5) 


For iron T ( ~ 1000 K, g =» 2, and S ~ 1; from (5) wc have A ~ 5000. With 
M s . « 1700 we have B E = AM =* (5000)(1700) * 10 5 * 7 C = 10 3 T. The exchange 
field in iron is very much stronger than the real magnetic field due to the other 
magnetic ions in the crystal: a magnetic ion produces a field ~ ju B /a 3 or about 
10 3 G = 0.1 T at a neighboring lattice point. 

The exchange field gives an approximate representation of the quantum- 
mechanical exchange interaction. On certain assumptions it is shown in texts 
on quantum theory that the energy of interaction of atoms i,j bearing electron 
spins .S’,, Sj contains a term 

U = -2/S; * , (6) 


where J is the exchange integral and is related to the overlap of the charge dis- 
tributions of the atoms i,j. Equation (6) is called the Heisenberg model. 

The charge distribution of a system of two spins depends on whether the 
spins are parallel or antiparallel 2 for the Pauli principle excludes two electrons 
of the same spin from being at the same place at the same time. It does not ex- 
clude two electrons of opposite spin. Thus the electrostatic energy of a system 
will depend on the relative orientation of the spins: the difference in energy 
defines the exchange energy. 

The exchange energy of two electrons may be written in the form ~2Js 1 ■ s 2 
as in (6), just as if there were a direct coupling between the directions of the 
two spins. For many purposes in ferromagnetism it is a good approximation to 
treat the spins as classical angular momentum vectors. 

We can establish an approximate connection between the exchange inte- 
gral / and the Curie temperature T c . Suppose that the atom under considera- 
tion has 3 nearest neighbors, each connected with the central atom by the 
interaction /. For more distant neighbors we take / as zero. The mean field 
theory result is 


j — 3k B T c 
J ~ 2 zS(S + 1) ' 


(7) 


Better statistical approximations give somewhat different results. For 
the sc, bcc, and fee structures with S = f, Rushbrooke and Wood give 


5 If two spins arc antiparallel, the wavefunctions of the two electrons must be symmetric, as in 

the combination «(r,)o(r 2 ) + u(r 2 )o(r 1 ). If the two spins arc parallel, the Pauli principle requires 
that the orbital part of the wavefunction be antisymmetric, as in «(r 1 )o(r 2 ) — u(r2)c(ri), for here if 
we interchange the coordinates r lt r 2 the wavefunction changes sign. If we set the positions equal 

so that r x = r s then the antisymmetric function vanishes: for parallel spins there is zero probability 
of finding the two electrons at the same position. 



326 


k B TJzJ = 0.28; 0.325; and 0.346, respectively, as compared with 0.500 from (7) 
for all three structures. If iron is represented by the Heisenberg model with 
S = 1, then the observed Curie temperature corresponds to J — 11.9 meV. 

Temperature Dependence of the Saturation Magnetization 

We can also use the mean field approximation below the Curie tempera- 
ture to find the magnetization as a function of temperature. We proceed as 
before, but instead of the Curie law we use the complete Brillouin expression 
for the magnetization. For spin | this is M = N/i tanh(p.B/k B T). 

If we omit the applied magnetic field and replace B by the molecular field 
B e — AM, then 

M — N/x tanh(pAM/k B T) . (8) 

We shall see that solutions of this equation with nonzero M exist in the tem- 
perature range between 0 and T c . 

To solve (8) we write it in terms of the reduced magnetization m = M/Np. 
and the reduced temperature t — k B T/Np 2 A , whence 

rn — tanh(m/f) . (9) 

We then plot the right and left sides of this equation separately as functions of 
m, as in Fig. 3. The intercept of the two curves gives the value of rn at the tem- 
perature of interest. The critical temperature is t — 1, or T c = Np 2 A/k B . 



Figure 3 Graphical solution of Eq. (9) for the reduced magnetization m as a function of tempera- 
ture. The reduced magnetization is defined as rn — M/Nfi. The left-hand side of Eq (9) is plotted 
as a straight line m with unit slope. The right-hand side is tanh(?n/t) and is plotted vs. m for three 
different values of the reduced temperature t = k B T/N(i 2 \ - T/T c . The three curves correspond to 
the temperatures 2 T c , T e , and 0.5 T c . The curve for t — 2 intersects the straight line m only at 
m = 0, as appropriate for the paramagnetic region (there is no external applied magnetic field). 
The curve for t = l (or T = T„) is tangent to the straight line m at the origin; this temperature 
marks the onset of ferromagnetism. The curve for t — 0.5 is in the ferromagnetic region and inter- 
sects the straight line m at about m = ■ ) 9i As f — > 0 the intercept moves up to m = 1, so that 
all magnetic moments arc lined up at absolute zero. 
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Figure 4 Saturation magnetization of nickel as a function of temperature, together with the 
theoretical curve for S = | on the mean field theory. Experimental values by P. Weiss and R. Forrer. 


The curves of M versus T obtained in this way reproduce roughly the fea- 
tures of the experimental results, as shown in Fig. 4 for nickel. As T increases, 
the magnetization decreases smoothly to zero at T — T c . This behavior 
classifies the usual ferromagnetic/paramagnetic transition as a second-order 
transition. 

The mean-field theory does not give a good description of the variation of 
M at low temperatures. For T < T c the argument of tanh in (9) is large, and 

tanh £ = 1 — 2e _2f . . . . 

To lowest order the magnetization deviation AM = M(0) — M(T) is 

AM = 2N/Jt exp(—2\Nfi 2 /k B T) . (10) 

The argument of the exponential is equal to —2 T../T. For T = 0. 1 T r we have 
A M/Np. = 4 X 10“ s . 

The experimental results show a much more rapid dependence of AM on 
temperature at low temperatures. At T = 0.1T C we have A M/M = 2 X 10 -3 
from the data of Fig. 5. The leading term in AM is observed from experiment 
to have the form 


AM/M(0) = AT 3/2 , (11) 

where the constant A has the experimental value (7.5 ± 0.2) X 10 6 deg 3/2 for 
Ni and (3.4 ± 0.2) X 10 -6 deg -3/2 for Fe. The result (11) finds a natural expla- 
nation in terms of spin wave theory. 
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Figure 5 Decrease in magnetization of nickel with temperature, after Argyle, Champ, and Pugh. 
In the plot AM = 0 at 4.2 k 


Saturation Magnetization at Absolute Zero 

Table 1 gives representative values of the saturation magnetization M s , the 
ferromagnetic Curie temperature, and the effective magneton number de- 
fined by M s ( 0) = n B Nix B , where N is the number of formula units per unit 
volume. Do not confuse n B with the paramagnetic effective magneton number 
p defined by (11.23). 


Table 1 Ferromagnetic crystals 


Substance 

Magnetization M„ 

in gauss 

n„(0 K). 

per formula unit 

Curie 

temperature, 

inK 

Room 

temperature 

0K 

Fe 

1707 

1740 

2.22 

1043 

Co 

1400 

1446 

1.72 

1388 

Ni 

485 

510 

0.606 

627 

Gd 

— 

2060 

7.63 

292 

Dy 

— 

2920 

10.2 

88 

MnAs 

670 

870 

3.4 

318 

MnBi 

620 

680 

3.52 

630 

MnSb 

710 

— 

3.5 

587 

CrO s 

515 

— 

2.03 

386 

Mn0Fe 2 0 3 

410 

— 

5.0 

573 

FeOFe 2 0 ; j 

480 

— 

4.1 

858 

NiOFe-O; 

270 

— 

2.4 

(858) 

CuOFc 2 0 3 

135 

— 

1.3 

728 

Mg0Fe 2 0 3 

no 

— 

1.1 

713 

EuO 

— 

1920 

6.8 

69 

Y.,Fe 5 0 12 

130 

200 

5.0 

560 
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Observed values of n B are often nonintegral. There are several possible 
causes. One is the spin-orbit interaction which adds or subtracts some orbital 
magnetic moment. Another cause in ferromagnetic metals is the conduction 
electron magnetization induced locally about a paramagnetic ion core. A third 
cause is suggested by the drawing in Fig. 1 of the spin arrangement in a ferri- 
magnet: if there is one atom of spin projection — S for every two atoms +S, the 
average spin is 5S. 

Are there in fact any simple ferromagnetic insulators, with all ionic spins 
parallel in the ground state? The few simple ferromagnets known at present 
include CrBr 3 , EuO, and EuS. 

A band or itinerant electron model accounts for the ferromagnetism of 
the transition metals Fe, Co, Ni. The approach is indicated in Figs. 6 and 7. 
The relationship of 4s and 3d bands is shown in Fig. 6 for copper, which is 
not ferromagnetic. If we remove one electron from copper, we obtain nickel 
which has the possibility of a hole in the 3d band. In the band structure 
of nickel shown in Fig. 7a for T > T c we have taken 2 X 0.27 = 0.54 of an 
electron away from the 3d band and 0.46 away from the 4s band, as compared 
with copper. 

The band structure of nickel at absolute zero is shown in Fig. 7b. Nickel is 
ferromagnetic, and at absolute zero n B = 0.60 Bohr magnetons per atom. After 
allowance for the magnetic moment contribution of orbital electronic motion, 
nickel has an excess of 0.54 electron per atom having spin preferentially ori- 
ented in one direction. The exchange enhancement of the susceptibility of 
metals was the subject of Problem 11.6. 



7.1 eVJ 
1_ 2 


4s 


— F ermi surface — 

~|2.2 eV 




\~J 

3.46 eV 

t 

\ 


3d 


3d t 

3di 

Filled — 10 electrons 


5 electrons 

5 electrons 




(b) 


Figure 6a Schematic relationship of 4s and 3 d bands in metallic copper. The 3 d band holds 10 
electrons per atom and is filled. The 4s band can hold two electrons per atom; it is shown half- 
filled, as copper has one valence electron outside the filled 3d shell. 

Figure 6b The filled 3d band of copper shown as two separate sub-bands of opposite electron 
spin orientation, each band holding five electrons. With both sub-bands filled as shown, the net 
spin (and hence the net magnetization) of the d band is zero. 
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Figure 7a Band relationships in nickel above the Curie temperature. The net magnetic moment 
is zero, as there are equal numbers of holes in the 3 d l and 3 d T bands. 

Figure 7b Schematic relationship of bands in nickel at absolute zero. The energies of the 3 d T 
and 3 d i sub-bands are separated by an exchange interaction. The 3 d T band is filled; the 3 d 4/ 
band contains 4.46 electrons and 0.54 hole. The 4s band is usually thought to contain approxi- 
mately equal numbers of electrons in both spin directions, and so we have not troubled to divide it 
into sub-bands. The net magnetic moment of 0.54 per atom arises from the excess population 
of the 3d t band over the 3d 4- band. It is often convenient to speak of the magnetization as arising 
from the 0.54 hole in the 3d ! band. 


MAGNONS 

A magnon is a quantized spin wave. We use a classical argument, just as we 
did for phonons, to find the magnon dispersion relation for co versus k. We 
then quantize the magnon energy and interpret the quantization in terms of 
spin reversal. 

The ground state of a simple ferromagnet has all spins parallel, as in Fig. 8a. 
Consider N spins each of magnitude S on a line or a ring, with nearest-neighbor 
spins coupled by the Heisenberg interaction: 

l/=-2/EW> • (12) 

P=1 



-Hah*- -Hah*- -H <2 H- 


(a) (b) (c) 

Figure 8 (a) Classical picture of the ground state of a simple ferromagnet; all spins are parallel, 
(b) A possible excitation; one spin is reversed, (c) The low-lying elementary excitations are spin 
waves. The ends of the spin vectors precess on the surfaces of cones, with successive spins ad- 
vanced in phase by a constant angle. 




12 Ferromagnetism and Antiferromagnetism 


331 



Figure 9 A spin wave on a line of spins, (a) The spins viewed in perspective, (b) Spins viewed 
from above, showing one wavelength. The wave is drawn through the ends of the spin vectors. 


Here J is the exchange integral and fiS p is the angular momentum of the 
spin at site p. If we treat the spins S p as classical vectors, then in the ground 
state S p • S p+1 = S 2 and the exchange energy of the system is U 0 = —2NJS 2 . 

What is the energy of the first excited state? Consider an excited state with 
one particular spin reversed, as in Fig. 8b. We see from (12) that this increases 
the energy by 8/S 2 , so that f /, — U, } + 8/S 2 . But we can form an excitation of 
much lower energy if we let all the spins share the reversal, as in Fig. 8c. The 
elementary excitations of a spin system have a wavelike form and are called 
magnons (Fig. 9). These are analogous to lattice vibrations or phonons. Spin 
waves are oscillations in the relative orientations of spins on a lattice; lattice vi- 
brations are oscillations in the relative positions of atoms on a lattice. 

We now give a classical derivation of the magnon dispersion relation. The 
terms in (12) which involve the pth spin are 

-2/Sp-(Sp_ 1 + Sp +1 ) . (13) 

We write magnetic moment at site p as ju ;l = ~gpt B S p . Then (13) becomes 

-/V[(-2// g^xv^ Vi)] . (14) 

which is of the form — fi p ■ B p , where the effective magnetic field or exchange 
field that acts on the pth spin is 

B p = (-2//g A t B )(S p _ 1 + Sp +1 ) . (15) 

From mechanics the rate of change of the angular momentum hS p is equal 
to the torque fi p X B p which acts on the spin: h dS p /dt = fi p X B ; „ or 

dS p /dt = (-g pt B /h) Sp X Bp = (2//ft)(S p X Sp_! + S p X Sp +1 ) . (16) 

In Cartesian components 

ds;/dt = (2j/h)[sy(s;_ 1 + s* +1 ) - + s» +1 )] , (17) 

and similarly for dS^/d.t and dS~,/dt. These equations involve products of spin 
components and are nonlinear. 
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If the amplitude of the excitation is small (if S x p , S-J < S), we may obtain an 
approximate set of linear equations by taking all S* = S and by neglecting 
terms in the product of S * and S'' which appear in the equation for dS z /dt. The 
linearized equations are 

ds;/dt = (2JS/h)(2Sy - S^_! - S* +1 ) ; (18a) 

dsy/dt = -(2jsm(2s; - s;^ - s; +J ) : (isb) 

dSp/dt — 0 . (19) 

By analogy with phonon problems we look for traveling wave solutions of 
(18) of the form 

S* = u exp[i(pka - cot)] ; S^, = v exp\i(pka ~ cot)] , (20) 

where u, v are constants, p is an integer, and a is the lattice constant. On sub- 
stitution into (18) we have 

—iwu — (2JS/h)(2 — e~ ,ku — e lka ) v — (4JS/h)(l — cos ka)v ; 

— i(ov = —(2JS/h)(2 — e~ ,ka — e ika )u = — (4JS/ft)(l — cos ka)u . 

These equations have a solution for u and o if the determinant of the coef- 
ficients is equal to zero: 

ico (4JS/h)( 1 — cos ka) 

— {AJS/h)(\ — cos ka) uo 

whence 

ha) = 4JS(1 — cos ka) . (22) 

This result is plotted in Fig. 10. With this solution we find that v = ~iu, corre- 
sponding to circular precession of each spin about the z axis. We see this on 
taking real parts of (20), with v set equal to ~iu. Then 

S*=u cos (pka — cat) ; S." = u sin(p/ca — cot) . 

Equation (22) is the dispersion relation for spin waves in one dimension 
with nearest-neighbor interactions. Precisely the same result is obtained from 
the quantum-mechanical solution; see QTS, Chapter 4. At long wavelengths 
ka <8 1, so that (1 — cos ka) = i(ka) 2 and 

hw = (2]Sa 2 )k 2 . (23) 

The frequency is proportional to k 2 ; in the same limit the frequency of a 
phonon is directly proportional to k. 
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neighbor interactions. 


The dispersion relation for a ferromagnetic cubic lattice with nearest- 
neighbor interactions 

hut — 2JS[z — ^ cos(k • S)] , (24) 

where the summation is over the z vectors denoted by 8 which join the central 
atom to its nearest neighbors. For ka <s 1, 

hw = (2 \JSa 2 )k 2 (25) 

for all three cubic lattices, where a is the lattice constant. 

The coefficient of k 2 often may be determined accurately by neutron scat- 
tering or by spin wave resonance in thin films. Chapter 13. By neutron scatter- 
ing G. Shirane and coworkers find, in the equation fiat = Dk 2 , the values 281, 
500, and 364 meV A 2 for D at 295 K in Fe, Co, and Ni, respectively. 

Quantization of Spin Waves. The quantization of spin waves proceeds as for 
photons and phonons. The energy of a mode of frequency a> k with n k magnons 
is given by 

= (nk + ■ (26) 

The excitation of a magnon corresponds to the reversal of one spin |. 
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Thermal Excitation of Magnons 

In thermal equilibrium the average value of the number of magnons ex- 
cited in the mode k is given by the Planck distribution 3 

M = — ji — ]i7t\ — T ‘ < 27) 

exp (ncoi/kgl ) — 1 

The total number of magnons excited at a temperature T is 

X n k = [ dco D((o)(n(at)) , (28) 

k J 

where D(a>) is the number of magnon modes per unit frequency range. The 
integral is taken over the allowed range of k, which is the first Brillouin zone. 
At sufficiently low temperatures we may evaluate the integral between 0 and 
00 because (n(w)) — > 0 exponentially as a> — > . 

Magnons have a single polarization for each value of k. In three dimen- 
sions the number of modes of wavevector less than k is (l/2ir) 3 (47rk 3 /3) per 
unit volume, whence the number of magnons D(w)da) with frequency 7 in dco at 
u) is (l/27r) 3 (4 Trk 2 )(dk/dco) d(o. In the approximation (25), 

a — \ir } " ■ 

Thus the density 7 of modes for magnons is 

DM = , (29) 

4ir 2 \2JSa 2 / 

so that the total number of magnons is, from (28), 


2 




i f k B r \ 3/2 f M ^ 
477 s \2JSa 2 ) Jo V-l 


The definite integral is found in tables and has the value (0.0587x4m 2 ). 

The number N of atoms per unit volume is Q/ad, where Q — 1, 2, 4 for sc, 
bcc, fee lattices, respectively 7 Now (f-n^/NS is equal to the fractional change 
of magnetization AM/M(0), whence 


AM _ 0.0587 | 

(k B T\ 

3/2 

1 • 

M( 0) SO 1 

[2JSJ 


(30) 


3 The argument is exactly as for phonons or photons. The Planck distribution follows for any 
problem where the energy levels are identical with those of a harmonic oscillator or collection of 
harmonic oscillators. 
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This result is the Bloch T 3 ' 2 law and has been confirmed experimentally. In 
neutron scattering experiments spin waves have been observed up to tempera- 
tures near the Curie temperature and even above the Curie temperature. 


NEUTRON MAGNETIC SCATTERING 

An x-ray photon sees the spatial distribution of electronic charge, whether 
the charge density is magnetized or unmagnetized. A neutron sees two aspects 
of a crystal: the distribution of nuclei and the distribution of electronic magne- 
tization. The neutron diffraction pattern for iron is shown in Fig. 11. 

The magnetic moment of the neutron interacts with the magnetic moment 
of the electron. The cross section for the neutron-electron interaction is of the 
same order of magnitude as for the neutron-nuclear interaction. Diffraction of 
neutrons by a magnetic crystal allows the determination of the distribution, 
direction, and order of the magnetic moments. 

A neutron can be inelastically scattered by the magnetic structure, with 
the creation or annihilation of a magnon (Fig. 12); such events make possible 
the experimental determination of magnon spectra. If the incident neutron 
has wavevector k,, and is scattered to k) with the creation of a magnon of 
wavevector k, then by conservation of crystal momentum k r , = k) + k + G, 
where G is a reciprocal lattice vector. By conservation of energy 


2 M n 


hX 2 

2M„ 


+ hw k , 


(31) 
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Figure 12 Scattering of a neutron by an ordered 
magnetic structure, with creation of a magnon. 




Figure 13 Magnon energy as a function of the square of the wavevector, for the ferromagnet 
MnPtj. (After B. Antonini and V. J. Minldewicz.) 


where ha > k is the energy of the magnon created in the process. The observed 
magnon spectrum for MnPt 3 is shown in Fig. 13. 


FERRIMAGNETIC ORDER 

In many ferromagnetic crystals the saturation magnetization at T = 0 K 
does not correspond to parallel alignment of the magnetic moments of the 
constituent paramagnetic ions, even in crystals for which the individual para- 
magnetic ions have their normal magnetic moments. 

The most familiar example is magnetite, Fe 3 0 4 or FeO • Fe 2 0 3 . From 
Table 11.2 we see that ferric (Fe 3+ ) ions are in a state with spin S = | and 
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8Fe 3+ 




Figure 14 Spin arrangements in magnetite, FeO • Fe 2 0 3 , showing how the moments of the Fe 3+ 
ions cancel out, leaving only the moments of the Fe 2+ ions. 


zero orbital moment. Thus each ion should contribute 5 n B to the saturation 
moment. The ferrous (Fe 2+ ) ions have a spin of 2 and should contribute 4 fi B , 
apart from any residual orbital moment contribution. Thus the effective num- 
ber of Bohr magnetons per Fe 3 0 4 formula unit should be about 2 X 5 + 4 = 14 
if all spins were parallel. 

The observed value (Table 1) is 4.1. The discrepancy is accounted for if 
the moments of the Fe 3+ ions are antiparallel to each other: then the observed 
moment arises only from the Fe z+ ion, as in Fig. 14. Neutron diffraction 
results agree with this model. 

A systematic discussion of the consequences of this type of spin order was 
given by L. Neel with reference to an important class of magnetic oxides 
known as ferrites. The usual chemical formula of a ferrite is MO • Fe 2 0 3 , 
where M is a divalent cation, often Zn, Cd, Fe, Ni, Cu, Co, or Mg. The term 
ferrimagnetic was coined originally to describe the ferrite-type ferromag- 
netic spin order such as Fig. 14, and by extension the term covers almost any 
compound in which some ions have a moment antiparallel to other ions. Many 
ferri magnets are poor conductors of electricity, a quality exploited in applica- 
tions such as rf transformer cores. 

The cubic ferrites have the spinel crystal structure shown in Fig. 15. 
There are eight occupied tetrahedral (or A) sites and 16 occupied octahedral 
(or B) sites in a unit cube. The lattice constant is about 8 A. A remarkable fea- 
ture of the spinels is that all exchange integrals J AA , J AB , and J BB are negative 
and favor antiparallel alignment of the spins connected by the interaction. But 
the AB interaction is the strongest, so that the A spins are parallel to each 
other and the B spins are parallel to each other, just in order that the A spins 
may be antiparallel to the B spins. If J in U - —2JS, ■ S,. is positive, we say that 
the exchange integral is ferromagnetic; if J is negative, the exchange integral is 
antiferromagnetic. 
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Figure 15 Crystal structure of the mineral spinel MgAl 2 0 4 ; the Mg 2 * ions occupy tetrahedral 
sites, each surrounded by four oxygen ions; the Al 3+ occupy octahedral sites, each surrounded by 
six oxygen ions. This is a normal spinel arrangement; the divalent metal ions occupy the tetrahe- 
dral sites. In the inverse spinel arrangment the tetrahedral sites are occupied by trivalent metal 
ions, while the octahedral sites are occupied half by divalent and half by trivalent metal ions. 


We now prove that three antiferromagnetic interactions can result in ferri- 
magnetism. The mean exchange fields acting on the A and B spin lattices may 
be written 


B a = ~AM a - jaMj ; B s = ~pM A - vM B ; (32) 

taking all mean field constants A, p, v to be positive. The minus sign then cor- 
responds to an antiparallel interaction. The interaction energy density is 

U = -f(B A -M A + B fi -Mg) = |A + pM A -M B + I vM\ ; (33) 

this is lower when M A is antiparallel to M s than when M A is parallel to M g . 
The energy when antiparallel should be compared with zero, because a possi- 
ble solution is M A = M B = 0. Thus when 

pM A M B > |(AM a + vM\) , (34) 

the ground state will have M A directed oppositely to M B . (Under certain condi- 
tions there maybe noncollinear spin arrays of still lower energy.) 

Curie Temperature and Susceptibility of Ferrimagnets 

We define separate Curie constants C A and C B for the ions on the A and B 
sites. For simplicity, let all interactions be zero except for an antiparallel interac- 
tion between the A and B sites: B A = — ^iM b ; B g = — /U.M A , where p is positive. 
The same constant p is involved in both expressions because of the form of (33). 
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We have in the mean field approximation 


(CGS) M a T = C A (B a - jl lM b ) ; M B T = C B (B a - p M A ) , (35) 

where B a is the applied field. These equations have a nonzero solution for M A 
and M fi in zero applied field if 


T 

liC B 



(36) 


so that the ferrimagnetic Curie temperature is given by T c — /x(C A C B ) l/2 . 
W’e solve (35) for M A and M g to obtain the susceptibility at T > T c : 


(CGS) 


M a + M B _ (C A + C b )T 2 fiC, A C B 

B a T 2 — T 2 c 


(37) 


a result more complicated than (4). Experimental values for Fe 3 0 4 are plotted 
in Fig. 16. The curvature of the plot of \/\ versus T is a characteristic feature 
of a ferrimagnet. We consider below the antiferromagnetic limit C A — C B . 


Iron Garnets. The iron garnets are cubic ferrimagnetic insulators with the 
general formula M :J Fe 5 0 12 , where M is a trivalent metal ion and the Fe is the 
trivalent ferric ion (S = 5 , L = 0). An example is yttrium iron garnet Y 3 Fe 50 12 , 
known as YIG. Here Y 3+ is diamagnetic. 

The net magnetization of YIG is due to the resultant of two oppositely 
magnetized lattices of Fe 3+ ions. At absolute zero each ferric ion contributes 
±5 fi n to the magnetization, but in each formula unit the three Fe 3+ ions 
on sites denoted as d sites are magnetized in one sense and the two Fe 3+ ions 
on a sites are magnetized in the opposite sense, giving a resultant of 5/jl b per 
formula unit in good agreement with the measurements of Geller et al. 
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The mean field at an a site due to the ions on the d sites is B a = 
— (1.5 X 10 4 )M t; . The observed Curie temperature 559 K of YIG is due to the 
a-d interaction. The only magnetic ions in YIG are the ferric ions. Because 
these are in an L = 0 state with a spherical charge distribution, their interac- 
tion with lattice deformations and phonons is weak. As a result YIG is charac- 
terized by very narrow linewidths in ferromagnetic resonance experiments. 

ANTIFERROMAGNETIC ORDER 

A classical example of magnetic structure determination by neutrons is 
shown in Fig. 17 for MnO, which has the NaCl structure. At 80 K there are 
extra neutron reflections not present at 293 K. The reflections at 80 K may be 
classified in terms of a cubic unit cell of lattice constant 8.85 A. At 293 K the 
reflections correspond to an fee unit cell of lattice constant 4.43 A. 

But the lattice constant determined by x-ray reflection is 4.43 A at both 
temperatures, 80 K and 293 K. We conclude that the chemical unit cell has the 
4.43 A lattice parameter, but that at 80 K the electronic magnetic moments of 



Figure 17 Neutron diffraction patterns for MnO below and above the spin-ordering temperature 
of 120 K, after C, G. Shull, W. A. Strauser, and E. O. Wollan. The reflection indices arc based on 
an 8.85 A cell at 80 K and on a 4.43 A cell at 293 K. At the higher temperature the Mn 21 ions are 
still magnetic, but they are no longer ordered. 
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Figure 18 Ordered arrangements of spins of the Mn 2+ ions in manganese oxide, MnO, as 
determined by neutron diffraction. The 0 2+ ions are not shown. 



Figure 19 Spin ordering in ferromagnets (/ > 0) and antiferromagnets (/ < 0). 

the Mn 2+ ions are ordered in some nonferromagnetic arrangement. If the 
ordering were ferromagnetic, the chemical and magnetic cells would give the 
same reflections. 

The spin arrangement shown in Fig. 18 is consistent with the neutron dif- 
fraction results and with magnetic measurements. The spins in a single [111] 
plane are parallel, but spins in adjacent [111] planes are antiparallel. Thus 
MnO is an antiferromagnet, as in Fig. 19. 

In an antiferromagnet the spins are ordered in an antiparallel 
arrangement with zero net moment at temperatures below the ordering or 
Neel temperature (Table 2). The susceptibility of an antiferromagnet is not 
infinite at T = T N , but has a weak cusp, as in Fig. 20. 

An antiferromagnet is a special case of a ferrimagnet for which both sub- 
lattices A and B have equal saturation magnetizations. Thus C A = C B in (37), 
and the Neel temperature in the mean field approximation is given by 


T n — fj.C , 


(38) 
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Table 2 Antiferromagnetic crystals 


Substance 

Paramagnetic 
ion lattice 

Transition 

temperature, 

T n , in K 

Curie-Weiss 

9, in K 

9 

t n 

*(0) 

x(t n ) 

MnO 

fee 

116 

610 

5.3 

2 

3 

MnS 

fee 

160 

528 

3.3 

0.82 

MnTe 

hex. layer 

307 

690 

2.25 


MnF 2 

be tetr. 

67 

82 

1.24 

0.76 

FeF 2 

be tetr. 

79 

117 

1.48 

0.72 

FeCl 2 

hex. layer 

24 

48 

2.0 

<0.2 

FeO 

fee 

198 

570 

2.9 

0.8 

CoCl 2 

hex. layer 

25 

38.1 

1.53 


CoO 

fee 

291 

330 

1.14 


NiCl 2 

hex. layer 

50 

68.2 

1.37 


NiO 

fee 

525 

-2000 

-4 


Cr 

bee 

308 





Paramagnetism Ferromagnetism Antiferromagnetism 
Susceptibility XX X 


-9 0 T 

X=-C- 

, * T + e 

Curie law Curie-Weiss law (T > T N ) 

(T > T c ) 

Figure 20 Temperature dependence of the magnetic susceptibility in paramagnets, ferromag- 
nets, and antiferromagnets. Below the Neel temperature of an antiferromagnet the spins have an- 
tiparallel orientations; the susceptibility attains its maximum value at T,, where there is a well- 
defined kink in the curve of \ versus T. The transition is also marked by peaks in the heat capacity 
and the thermal expansion coefficient. 




where C refers to a single sublattice. The susceptibility in the paramagnetic 
region T > T N is obtained from (37): 


_ 2CT - 2/xC 2 _ 2C _ 2 C 

x T 2 -( M C) 2 T + ^c t + t, 

The experimental results at T > T N are of the form 


(39) 


(CGS) 


2C 

x T+e ' 


(40) 
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Experimental values of 6/T N listed in Tabic 2 often differ substantially from 
the value unity expected from (39). Values of 9/T N of the observed magnitude 
may be obtained when next-nearest-neighbor interactions are provided for, 
and when possible sublatticc arrangements are considered. If a mean field 
constant — e is introduced to describe interactions within a sublattice, then 
9/T n = (p + e)/(p - e). 

Susceptibility Below the Neel Temperature 

There are two situations: with the applied magnetic field perpendicular to 
the axis of the spins; and with the field parallel to the axis of the spins. At and 
above the Neel temperature the susceptibility is nearly independent of the di- 
rection of the field relative to the spin axis. 

For B , perpendicular to the axis of the spins we can calculate the suscepti- 
bility by elementary considerations. The energy density in the presence of the 
field is, with M — \M a \ = \M B \, 

U = pM A -M s - B„-(M a + M b ) = -pM 2 [ 1 - |(2 <p) 2 ] - 2 B a M<p , (41) 

where 2ip is the angle the spins make with each other (Fig. 21a). The energy is 

a minimum when 

dU/dcp = 0 = 4/jlM 2 <p — 2 B a M ; <p = B jlpM , (42) 

so that 

(CGS) = 2M<p/B B - l/p . (43) 

In the parallel orientation (Fig. 21b) the magnetic energy is not changed if 
the spin systems A and B make equal angles with the field. Thus the suscepti- 
bility at T = 0 K is zero: 

*|(0) = 0. (44) 





(b) 

Figure 21 Calculation of (a) perpendicular and (b) parallel susceptibilities at 0 K, in the mean 
field approximation. 
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Figure 22 Magnetic susceptibility of manganese fluoride, MnF 2 , parallel and perpendicular to 
the tetragonal axis. (After S. Foner.) 


The parallel susceptibility increases smoothly with temperature up to T v . 
Measurements on MnF 2 are shown in Fig. 22. In very strong fields the spin 
systems will turn discontinuously from the parallel orientation to the perpen- 
dicular orientation where the energy is lower. 

Antiferromagnetic Magnons 

We obtain the dispersion relation of magnons in a one-dimensional anti- 
ferromagnet by making the appropriate substitutions in the treatment 
(16)-(22) of the ferromagnetic line. Let spins with even indices 2 p compose 
sublattice A, that with spins up (S z = S); and let spins with odd indices 2p + 1 
compose sublattice B, that with spins down (S~ = ~S). 

We consider only nearest-neighbor interactions, with J negative. Then 


(18) written for A becomes, with a careful look at (17), 

dsydt = (2JS/h)(-2S% - S|,_ T - S|p + i) ; (45a) 

dS%/dt = —(2]S/fi)(—2S\ p - S* 2p _ x - S* 2p+J ) . (45b) 

The corresponding equations for a spin on B are 

dS\ v+ M = (2JS/h)(2b\ +1 + + Si- +2 ) ; (46a) 

dSl p+1 /dt = —(2JS/h)(2S x 2p+l + Sl p + Si p1 2 ) . (46b) 

We form S + = S x + iS then 

dSt p klt = {2iJS/h){2St p + SJ_ L + S£ +1 ) ; (47) 

dS+, +1 /dt = - (2iJS/h)(2Sl p+1 + S 2 + p + S£ +2 ) . (48) 
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Figure 23 Magnon dispersion relation in the simple cubic antiferromagnet RbMnF 3 as deter- 
mined at 4.2 K by inelastic neutron scattering. (After C. G. Windsor and R. W. H. Stevenson.) 


We look for solutions of the form 

S^p — u exp[i2pka — iwt\ ; S^, +1 = v exp[i(2p 1 1) ka — iwt] , 

so that (47) and (48) become, with o> e![ = — 4 JS/h — 4\j \S/fi. 

om = | o > ex ( 2 u + ve~ ,ka + ve' ka ) ; 

—ojv = | co ex (2v + tie ,ka + ue ika ) . 

Equations (50) have a solution if 

w ex~ w ex coska _ Q 

w ex cos ka o) CI + a> 

thus « 2 = - cos 2 ka) ; &> = w ex | sin ka | . (52) 

The dispersion relation for magnons in an antiferromagnet is quite differ- 
ent from (22) for magnons in a ferromagnet. For ka 1 we see that (52) is lin- 
ear in k: a> = <a cx \ka\. The magnon spectrum of RhMnF 3 is shown in Fig. 23, as 
determined by inelastic neutron scattering experiments. There is a large re- 
gion in which the magnon frequency is linear in the wavevector. 

Well-resolved magnons have been observed in MnF 2 at specimen temper- 
atures up to 0.93 of the N£el temperature. Thus even at high temperatures the 
magnon approximation is useful. 


(49) 


(50a) 

(50b) 


(51) 
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FERROMAGNETIC DOMAINS 

At temperatures well below the Curie point the electronic magnetic mo- 
ments of a ferromagnet are essentially parallel when regarded on a micro- 
scopic scale. Yet, looking at a specimen as a whole, the magnetic moment may 
be very much less than the saturation moment, and the application of an exter- 
nal magnetic field may be required to saturate the specimen. The behavior ob- 
served in polycrystalline specimens is similar to that in single crystals. 

Actual specimens are composed of small regions called domains, within 
each of which the local magnetization is saturated. The directions of magneti- 
zation of different domains need not be parallel. An arrangement of domains 
with approximately zero resultant magnetic moment is shown in Fig. 24. Do- 
mains form also in antiferromagnetics, ferroelectrics, antiferroelectrics, ferro- 
elastics, superconductors, and sometimes in metals under conditions of a 
strong de Haas-van Alphen effect. The increase in the gross magnetic moment 
of a ferromagnetic specimen in an applied magnetic field takes place by two 
independent processes: 

• In weak applied fields the volume of domains (Fig. 25) favorably oriented 
with respect to the field increases at the expense of unfavorably oriented 
domains; 



Figure 24 Ferromagnetic domain pattern on a single crystal platelet of nickel. The domain 
boundaries are made visible by the Bitter magnetic powder pattern technique. The direction of 
magnetization within a domain is determined by observing growth or contraction of the domain in 
a magnetic field. (After R. W. De Blois.) 
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Applied field — *■ 

Figure 25 Representative magnetization curve, showing the dominant magnetization processes 
in the different regions of the curve. 



Figure 26 The technical magnetization curve (or hysteresis loop). The coercivity H c is the re- 
verse field that reduces B to zero; a related coercivity H d reduces M or B — H to zero. The 
remanence II r is the value of II at H = 0. The saturation induction B, is the limit of B — H at large 
H, and the saturation magnetization M s — BJArr. In SI the vertical axis is B — fi 0 (H + M). 


• In strong applied fields the domain magnetization rotates toward the 
direction of the field. 

Technical terms defined by the hysteresis loop are shown in Fig. 26. The 
coercivity is usually defined as the reverse field H c that reduces the induction 
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Figure 27 Magnetization curves for single crystals of iron, nickel, and cobalt. From the curves 
for iron we see that the [100] directions are easy directions of magnetization and the [111] direc- 
tions are hard directions. The applied field is B„. (After Honda and Kaya.) 




(b) 


Figure 28 Asymmetry of the overlap of electron distributions on neighboring ions provides one 
mechanism of magnetociystalline anisotropy. Because of spin-orbit interaction the charge distrib- 
ution is spheroidal and not spherical. The asymmetry is tied to the direction of the spin, so that a 
rotation of the spin directions relative to the crystal axes changes the exchange energy and also 
changes the electrostatic interaction energy of the charge distributions on pairs of atoms. Both ef- 
fects give rise to an anisotropy energy. The energy of (a) is not the same as the energy of (b). 


B to zero, starting from saturation. In high coercivity materials the coercivity 
H ci is defined as the reverse field that reduces the magnetization M to zero. 

Anisotropy Energy 

There is an energy in a ferromagnetic crystal which directs the magnetization 
along certain crystallographic axes called directions of easy magnetization. This 
energy is called the magnetocrystalline or anisotropy energy. It does not 
come about from the pure isotropic exchange interaction considered thus far. 

Cobalt is a hexagonal crystal. The hexagonal axis is the direction of easy 
magnetization at room temperature, as shown in Fig. 27. One origin of the 
anisotropy energy is illustrated by Fig. 28. The magnetization of the crystal sees 
the crystal lattice through orbital overlap of the electrons: the spin interacts 
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with the orbital motion by means of the spin-orbit coupling. In cobalt the 
anisotropy energy density' is given by 

U K — K[ sin 2 0 + K 2 sin 4 6 , (53) 

where 0 is the angle the magnetization makes with the hexagonal axis. At room 
temperature K{ = 4.1 X 10 6 erg/cm 3 ; = 1.0 X 10 B erg/cm 3 . 

Iron is a cubic crystal, and the cube edges are the directions of easy mag- 
netization. To represent the anisotropy energy of iron magnetized in an arbi- 
trary' direction with direction cosines ay, a 2 , a 3 referred to the cube edges, we 
are guided by cubic symmetry. The expression for the anisotropy energy must 
be an even power of each a,, provided opposite ends of a crystal axis are equiv- 
alent magnetically, and it must be invariant under interchanges of the 
among themselves. The lowest order combination satisfying the sy-mmetry re- 
quirements is af + a| + a|, but this is identically equal to unity and does not 
describe anisotropy effects. The next combination is of the fourth degree: 
a\a\ + afa 3 + a|a|, and then of the sixth degree: alaia 2 . Thus 

U K = Kjia'jatl + afaf 4- afaf) + K 2 afa|af . (54) 

At room temperature in iron = 4.2 X 10 5 erg/cm 3 and K 2 = 1.5 X 10 s erg/cm 3 . 

Transition Region Between Domains 

A Bloch wall in a crystal is the transition layer that separates adjacent 
regions (domains) magnetized in different directions. The entire change in 
spin direction between domains does not occur in one discontinuous jump 
across a single atomic plane, but takes place in a gradual way over many atomic 
planes (Fig. 29). The exchange energy- is lower when the change is distributed 
over many spins. This behavior may be understood by interpreting the Heisen- 
berg equation (6) classically. We replace cos <p by 1 — ftp 2 ; then tv c . x = ]S z tp 2 is 
the exchange energy between two spins making a small angle tp with each 
other. Here ] is the exchange integral and S is the spin quantum number; w ex is 
referred to the energy' for parallel spins. 

If a total change of 77 occurs in N equal steps, the angle between neighbor- 
ing spins is 7 r/N, and the exchange energy’ per pair of neighboring atoms is 
w ex = /.S 2 (7r/A 7 ) 2 . The total exchange energy of a line of N + 1 atoms is 

Nw ex = JS 2 7j 2 /N . (55) 

The wall would thicken without limit were it not for the anisotropy energy, 
which acts to limit the width of the transition layer. The spins contained within 
the wall are largely directed away from the axes of easy magnetization, so there 
is an anisotropy energy associated with the wall, roughly proportional to the 
wall thickness. 
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Figure 29 The structure of the Bloch wall separating domains. In iron the thickness of the transi- 
tion region is about 300 lattice constants. 


Consider a wall parallel to the cube face of a simple cubic lattice and sepa- 
rating domains magnetized in opposite directions. We wish to determine the 
number N of atomic planes contained within the wall. The energy per unit area 
of wall is the sum of contributions from exchange and anisotropy energies: 
0\v iTgj. T <r anis . 

The exchange energy is given approximately by (55) for each line of atoms 
normal to the plane of the wall. There are 1/a 2 such lines per unit area, where 
a is the lattice constant. Thus cr ex = ir 2 JS 2 /Na 2 per unit area of wall. 

The anisotropy energy is of the order of the anisotropy constant times the 
thickness Na, or <r anis ~ KNa- therefore 


<t w ~ ( 7 rfS^Na 2 ) + KNa . 

(56) 

This is a minimum with respect to N when 


daJdN = 0 = —( / n 2 JS 2 /N 2 a 2 ) + Ka ■ 

(57) 

or 


N = (v 2 JS 2 /Ka 3 ) m . 

(58) 


For order of magnitude, N ~ 300 in iron. 

The total wall energy per unit area on our model is 

o- w = 2Tr(KJS 2 /a) m ; 


(59) 
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Figure 30 The origin of domains. 


in iron <x w ~ 1 erg/cm 2 . Accurate calculation for a 180° wall in a (100) plane 
gives cr w = 2(2K l JS 2 /a) m . 

Origin of Domains 

Landau and Lifshitz showed that domain structure is a natural conse- 
quence of the various contributions to the energy — exchange, anisotropy, and 
magnetic — of a ferromagnetic body. 

Direct evidence of domain structure is furnished by photomicrographs of 
domain boundaries obtained by the technique of magnetic powder patterns 
and by optical studies using Faraday rotation. The powder pattern method 
developed by F. Bitter consists in placing a drop of a colloidal suspension of 
finely divided ferromagnetic material, such as magnetite, on the surface of the 
ferromagnetic crystal. The colloid particles in the suspension concentrate 
strongly about the boundaries between domains where strong local magnetic 
fields exist which attract the magnetic particles. The discovery of transparent 
ferromagnetic compounds has encouraged the use also of optical rotation for 
domain studies. 

We may understand the origin of domains by considering the structures 
shown in Fig. 30, each representing a cross section through a ferromagnetic 
single crystal. In (a) we have a single domain; as a consequence of the mag- 
netic “poles” formed on the surfaces of the crystal this configuration will have 
a high value of the magnetic energy (l/8-7r) / B 2 dV. The magnetic energy den- 
sity for the configuration shown will be of the order of M 2 — 10 6 erg/cm 3 ; here 
M s denotes the saturation magnetization, and the units are CGS. 

In (b) the magnetic energy is reduced by roughly one-half by dividing the 
crystal into two domains magnetized in opposite directions. In (c) with N do- 
mains the magnetic energy is reduced to approximately 1/N of the magnetic 
energy of (a), because of the reduced spatial extension of the field. 





Figure 31 Domain of closure at the end of a single crystal iron whisker. The face is a (100) plane; 
the whisker axis is [001]. (Courtesy of R. V. Coleman, C. G. Scott, and A. Isin.) 


In domain arrangements such as (d) and (e) the magnetic energy is zero. 
Here the boundaries of the triangular prism domains near the end faces of the 
crystal make equal angles (45°) with the magnetization in the rectangular do- 
mains and with the magnetization in the domains of closure. The component 
of magnetization normal to the boundary is continuous across the boundary 
and there is no magnetic field associated with the magnetization. The flux cir- 
cuit is completed within the crystal — thus giving rise to the term domains of 
closure for surface domains that complete the flux circuit, as in Fig. 31. 

Domain structures are often more complicated than our simple examples, 
but domain structure always has its origin in the possibility of lowering the 
energy of a system by goingfrom a saturated configuration with high magnetic 
energy to a domain configuration with a lower energy. 

Coercivity and Hysteresis 

The coercivity is the magnetic field H c required to reduce the magnetiza- 
tion or the induction B to zero (Fig. 26). The value of the coercivity ranges 
over seven orders of magnitude; it is the most sensitive property of ferromag- 
netic materials which is subject to control. The coercivity may vary from 600 G 
in a loudspeaker permanent magnet (Alnico V) and 10,000 G in a special high 
stability magnet (SmCo 5 ) to 0.5 G in a commercial power transformer (Fe-Si 
4 wt. pet.) and 0.002 G in a pulse transformer (Supermalloy). Low coercivity is 
desired in a transformer, for this means low hysteresis loss per cycle of opera- 
tion. Materials with low coercivity are called soft; those with high coercivity 
are called hard, although there is not necessarily a 1 : 1 relationship of mag- 
netic hardness with mechanical hardness. 

The coercivity decreases as the impurity content decreases and also as in- 
ternal strains are removed by annealing (slow cooling). Amorphous ferromag- 
netic alloys may have low coercivity, low hysteresis losses, and high permeabil- 
ity. Alloys that contain a precipitated phase may have a high coercivity, as in 
Alnico V (Fig. 32). 
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Figure 32 Microstructure of Alnico V in its optimum state as a permanent magnet. The composi- 
tion of Alnico V is, by weight percent, 8 Al, 14 Ni, 24 Co, 3 Cu, 51 Fe. As a permanent magnet it is 
a two-phase system, with fine particles of one phase embedded in the other phase. The precipita- 
tion is carried out in a magnetic field, and the particles are oriented with their long axis parallel to 
the field direction. The width shown is 1.1 fim. (Courtesy of F. E. Luborsky.) 


Soft magnetic materials are used to concentrate and shape magnetic flux, 
as in motors, generators, transformers, and sensors. Useful soft materials in- 
clude electrical steels (usually alloyed with several percent of silicon to in- 
crease electrical resistivity and to decrease anisotropy); various alloys of 
Fe-Co-Mn, starting with permalloys of composition near Ni 78 Fe 2 2 , which have 
near-zero anisotropy energy and near-zero magnetostriction; NiZn and MnZn 
ferrites; and metallic glasses produced by rapid solidification. A commercial 
metallic glass (METGLAS 2605S-2) with composition Fe 79 B 13 Sig has a hys- 
teresis loss per cycle much lower than the best grain-oriented silicon steel. 

The high coercivity of materials composed of very small grains or fine 
powders is well understood. A sufficiently small particle, with diameter less 
than 10~ 5 or 10~ 6 cm, is always magnetized to saturation as a single domain be- 
cause the formation of a flux-closure configuration is energetically unfavorable. 
In a single domain particle it is not possible for magnetization reversal to take 
place by means of the process of boundary displacement, which usually re- 
quires relatively weak fields. Instead the magnetization of the particle must 
rotate as a whole, a process that may require large fields depending on 
the anisotropy energy of the material and the anisotropy of the shape of the 
particle. 

The coercivity of fine iron particles is expected theoretically to be about 
500 gauss on the basis of rotation opposed by the crystalline anisotropy energy, 
and this is of the order of the observed value. Higher coercivities have been 
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reported for elongated iron particles, the rotation here being opposed by the 
shape anisotropy of the demagnetization energy. 

Rare earth metals in alloys with Mn, Fe, Co, and Ni have very large crystal 
anisotropies K and correspondingly large coercivities, of the order of 2 KJM. 
These alloys are exceptionally good permanent magnets. For example, the 
hexagonal compound SmCo 5 has an anisotropy energy 1.1 X 10 8 erg cm 3 , 
equivalent to a coercivity 2 KJM of 290 kG (29 T). Magnets of Nd 2 Fe 14 B have 
energy products as high as 50 MGOe, exceeding all other commercially avail- 
able magnets. 


SINGLE-DOMAIN PARTICLES 

The dominant industrial and commercial applications of ferromagnetism 
are in magnetic recording devices, where the magnetic material is in the form 
of single-domain particles or regions. The total value of the production of 
magnetic devices for recording may he comparable with the total value of 
semiconductor device production and greatly exceeds the value of supercon- 
ducting device production, the latter being held back by low critical tempera- 
tures, as compared with magnetic Curie temperatures. The magnetic record- 
ing devices or memories typically are in the form of hard disks in computers 
and tape in video and audio recorders. 

An ideal single-domain particle is a fine particle, usually elongated, that 
has its magnetic moment directed toward one end or the other of the particle. 
The alternative orientations may be labeled as N or S; + or in digital 
recording, as 0 or 1. To have digital properties a ferromagnetic particle should 
be fine enough, typically 10-100 nm, so that only one domain is within the 
particle. If the fine particle is elongated (acicular) or has uniaxial crystal sym- 
metry, only two values of the magnetic moment of the single domain are per- 
mitted, which is what one wants for digital properties. The first successful 
recording material was acicular r-Fe 2 0 3 with length -to-width ratio of about 
5:1, coercivity near 200 Oe and a length <1 /am; chromium dioxide Or() 2 is 
the basis of a better material, in a form highly acicular (20 : 1) with coercivity' 
near 500 Oe. 

Effective elongation can be attained with spheres by making a chain, 
like a string of beads. An ensemble of such chains or of elongated single do- 
main particles is said to exhibit superparamagnetism if the magnetic mo- 
ment of a unit is constant. If /a is the magnetic moment in a magnetic field B, 
then the net magnetization of the ensemble will follow the Curie-Brillouin- 
Langevin law of Chapter 11 if the particles are embedded in a liquid so that 
they are each free to rotate as a whole. If the particles are frozen in a solid, 
there will be a remanent magnetization (Fig. 26) after removal of an applied 
field. 
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Geomagnetism and Biomagnetism 

Single domain ferromagnetic properties are of special geological interest 
in sedimentary rocks because the rocks through their remanent magnetization 
earn 7 a memory of the direction of the earth’s magnetic field at the time that 
they were laid down, and thus of the geographical location of the rocks at that 
epoch. The magnetic record is perhaps the most important basis of the theory 
of the drift of continents. Annually, layers of sediment are deposited in stream 
beds, layers that may bear some magnetic particles in single domain form. This 
record persists over at least 500 million years of geological time and can tell us 
where on the surface of the earth the deposit was laid down at a given time. 
Lava flows also record magnetic field directions. 

The change in magnetization from layer to layer gives a superb historical 
record of the drift of the continental plates on the earth’s surface. The paleo- 
magnetic record is one basis of the branch of geology called plate tectonics. 
The original interpretation of the record was made more difficult, or more ex- 
citing, by the associated discovery (Brunhes, 1906) that the magnetic field of 
the earth itself can show reversals in direction, an effect contained within the 
standard dynamo theory 7 of the earths magnetism. Reversals have taken place 
once every 7 1 X 10 4 to 25 X 10 fi years. When a reversal occurs, it is relatively 
sudden. 

Fine single domain particles, often of magnetite Fe 3 0 4 , are even of impor- 
tance in biology. A direction-seeking effect known as magnetotaxis often con- 
trols, possibly sometimes along with an astronomical guide system, the motion 
of bacteria, the migration of birds, and the movements of homing pigeons and 
bees. The effect is due to the interaction of a single domain particle (or cluster 
of such particles, Fig. 33) in the organism with the external magnetic field of 
the earth. 


Magnetic Force Microscopy 

The success of the scanning tunneling microscope (STM) stimulated the 
development of related scanning probe devices, of which the scanning mag- 
netic force microscope is one of the most effective. A sharp tip of a magnetic 
material, such as nickel, is mounted on a cantilever lever (Fig. 34). Ideally, but 
not yet, the tip is a single domain particle. Forces from the magnetic sample 
act on the tip and cause a change, such as a deflection, in the cantilever status, 
and an image is formed by scanning the sample relative to the tip. The mag- 
netic force microscope (MFM) is the only magnetic imaging technique that 
can provide high resolution (10-100 nm) with little surface preparation. One 
can, for example, observe and image the magnetic flux that exits from the 
surface at the intersection of a Bloch wall with the surface (Fig. 29). An impor- 
tant application is to the study 7 of magnetic recording media — Figure 35 shows 
the magnetic signal from a test pattern of 2 /am bits magnetized in the plane of 



Figure 33 Thin section of a cell of a magnetotactic bac- 
terium showing a chain of 50 nm particles of Fe. 3 0 4 . Draw- 
ing by Marta Puebla from a photograph by R. B. Frankel 
and others. 
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Figure 34 Basic concept of magnetic force microscopy. A magnetic tip attached to a flexible can- 
tilever is used to detect the magnetic field produced by the regions of alternating magnetization in 
the plane of the sample. (After Gruetter, Mamin, and Rugar, 1992.) 



Figure 35 Test strip magnetization in the plane of a Co-alloy disk in 2 (»m bits, as detected by 
MFM close above the plane of the disk. (After Rugar et al.) 
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a Co-alloy disk; the parallel component of the field seen by the sensor tip is 
what the photo shows. 


SUMMARY 

{In CGS Units) 

• The susceptibility of a ferromagnet above the Curie temperature has the 
form x — C/(T — T c ) in the mean field approximation. 

• In the mean field approximation the effective magnetic field seen by a mag- 
netic moment in a ferromagnet is B„ + AM, when A = T../C and B a is the 
applied magnetic field. 

• The elementary excitations in a ferromagnet are magnons. Their dispersion 
relation for ka 1 has the form heo ~ ]k 2 a 2 in zero external magnetic field. 
The thermal excitation of magnons leads at low temperatures to a heat ca- 
pacity and to a fractional magnetization change both proportional to T' V2 . 

• In an antiferromagnet two spin lattices are equal, but antiparallel. In a ferri- 
magnet two lattices are antiparallel, but the magnetic moment of one is 
larger than the magnetic moment of the other. 

• In an antiferromagnet the susceptibility above the Neel temperature has the 
form* = 2 C/(T + 6). 

• The magnon dispersion relation in an antiferromagnet has the form 
hco ~ Jka. The thermal excitation of magnons leads at low temperatures to a 
term in T 3 in the heat capacity, in addition to the phonon term in T 3 . 

• A Bloch wall separates domains magnetized in different directions. The 
thickness of a wall is ~(J/Kti 3 ) V2 lattice constants, and the energy per unit 
area is s =(KJ/a) m , where K is the anisotropy energy density. 


Problems 

1. Magnon dispersion relation. Derive the magnon dispersion relation (24) for a 
spin S on a simple cubic lattice, z = 6. Hint: Show first that (18a) is replaced by 

dSUdt = (2JS/h)(6S y p - 2 S* +s ) , 
a 

where the central atom is at p and the six nearest neighbors are connected to it by 
six vectors 5. Look for solutions of the equations for dS r p /dt and dS y p /dt of the form 
exp(ik • p — ia>t). 

2. Heat capacity of magnons. Use the approximate magnon dispersion relation 

oj = Ak 2 to find the leading term in the heat capacity of a three-dimensional ferro- 
magnet at low temperatures k B T < J. The result is 0.113 k B (k t T/hA) m , per unit 
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volume. The zeta function that enters the result may be estimated numerically; it is 
tabulated in Jahnke-Emde. 

3. Neel temperature. Taking the effective fields on the two-sublattice model of an 
antiferromagnetic as 


show that 


B a = B a - pU B - e\f A ; B„ = B a - pM A ~ eM l 


e _ h- + * 

T v M - e ' 


4. Magnetoelastic coupling. In a cubic crystal the elastic energy density' in terms of 
the usual strain components is (Chapter 3) 


u«i = iC„(4 


+ e?j,j + e:. 


+ lC u (e% + el, + e 2 . x ) + C n (e„^e. z + e xx e„ 


+ 


and the leading term in the magnetic anisotropy energy density is, from (54), 


T T T 7 / 9 2 I 0 0 | 9 9\ 

b K — kyiayat, + a 2 a 2 + crjaq) . 

Coupling between elastic strain and magnetization direction may be taken formally 
into account by including in the total energy density a term 

U c = B y {a\e xx + ale m + afej + B 2 (<XyCttf xy + a 2 a 3 e F + a^e.J 

arising from the strain dependence of U K \ here By and B 2 arc called magnetoelastic 
coupling constants. Show that the total energy is a minimum when 

_ By[C 12 — af(C n + 2 C 12 )] _ ^ 2 a t a j 

~ [(C u - C 12 )(C n + 2C 12 )J ; “ ~C^r 1 J> ' 

This explains the origin of magnetostriction, the change of length on magnetization. 

5. Coercive force of a small particle, (a) Consider a small spherical single-domain 
particle of a uniaxial ferromagnet. Show that the reverse field along the axis re- 
quired to reverse the magnetization is B a — 2KJM S , in CCS units. The coercive force 
of single-domain particles is observed to he of this magnitude. Take U K = K sin 2 6 as 
the anisotropy energy density and l! M = ~B a M cos 0 as the interaction energy den- 
sity with the external field; here 8 is the angle between B, and M. Hint: Expand the 
energies for small angles about 8 — it, and find the value of B„ for which I/ K + U M 
does not have a minimum near 9 = tt. (b) Show that the magnetic energy of a satu- 
rated sphere of diameter d is —Mfrf 3 . An arrangement with appreciably' less mag- 
netic energy has a single wall in an equatorial plane. The domain wall energy' will be 
TT(T t jI 2 !4, where a w is the wall energy per unit area. Estimate for cobalt the critical 
radius below which the particles are stable as single domains, taking the value of 
JS 2 /a as for iron. 

6. Saturation magnetization near T v . Show' that in the mean field approximation 
the saturation magnetization just below the Curie temperature has the dominant 
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temperature dependence (T c — T)' rl . Assume the spin is The result is the same as 
that for a second-order transition in a ferroelectric crystal, as discussed in Chapter 
16, The experimental data for ferromagnets (Table 1) suggest that the exponent is 
closer to 0.33. 

7. Neel wall. The direction of magnetization change in a domain wall goes from that 
of the Bloch wall to that of a Neel wall (Fig. 36) in thin films of material of negligi- 
ble crystalline anisotropy energy, such as Permalloy. The intercept of the Bloch wall 
with the surface of the film creates a surface region of high demagnetization energy. 
The Neel wall avoids this intercept contribution, but at the expense of a demagneti- 
zation contribution throughout the volume of the wall. The Neel wall becomes en- 
ergetically favorable when the film becomes sufficiently thin. Consider, however, 
the energetics of the Ndel wall in bulk material of negligible crystalline anisotropy 
energy. There is now a demagnetization contribution to the wall energy density. By 
a qualitative argument similar to (56), show that tT u .^‘(u 2 JS 2 /Na 2 ') + (2irM 2 Na). Find 
N for which er u , is a minimum. Estimate the order of magnitude of cr,,, for typical val- 
ues of J, M s , and a. 

8. Giant magnetoresistance. In a ferromagnetic metal, the conductivity cr p for elec- 
trons whose magnetic moments are oriented parallel to the magnetization is typi- 
cally larger than <r 0 for those antiparallel to the magnetization. Consider a ferromag- 
netic conductor consisting of two separate regions of identical dimensions in series 
whose magnetizations can be independently controlled. Electrons of a given spin 
flow first through one region and then through the other. It is observed that the re- 
sistance when both magnetizations point upwards, RfT. ' s lower than the resistance 
when they point opposite, Ktj.. This resistance change can be large for cr Ja a §=■ 1, 
and the phenomenon is called giant magnetoresistance (GMR). A small external 
magnetic field can switch the resistance from firi to Itf? by reorienting the magneti- 
zation of the second layer. This effect is increasingly used in magnetic storage appli- 
cations such as the magnetic bit readout in hard drives. The giant rnagnetnresis- 
tance ratio is defined as: 


CMRti = 


R U 
R t t 



Figure 36 A Bloch wall and a Neel wall in a thin film. The magnetization in the Bloch wall is nor- 
mal to the plane of the film and adds to the wall energy a demagnetization energy ~M$8d per unit 
length of wall, where S is the wall thickness and d the film thickness. In the Ndel wall the magneti- 
zation is parallel to the surface; the addition to the wall energy is negligible when d <t S. The addi- 
tion to the Neel wall energy when d S> 8 is the subject of Problem 7. (Alter S. Middelhock.) 
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(a) If there is no spin-flip scattering for the conduction electrons, show that 

GMRR = (ir/ir,, + trj<r p - 2)/4 

(Hint: Treat the spin-up and spin-down conduction electrons as independent con- 
ducting channels in parallel.) (b) If cr a — > 0, explain physically why the resistance in 
the T J- magnetization configuration is infinite. 
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NOTATION: In this chapter the symbols B a and B 0 refer to the applied field, 
and B, is the applied field plus the demagnetizing field. In particular we write 
— B q z. For CGS readers it may be simpler to read H for B whenever it 
occurs in this chapter. 
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Figure 1 Electron spin resonance absorption in MnS0 4 at 298 K at 2.75 GHz, after Zavoisky. 
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CHAPTER 13: MAGNETIC RESONANCE 


In this chapter we discuss dynamical magnetic effects associated with the spin 
angular momentum of nuclei and of electrons. The principal phenomena are 
often identified in the literature by their initial letters, such as 

NMR: nuclear magnetic resonance 
NQR: nuclear quadrupole resonance 

EPR or ESR: electron paramagnetic or spin resonance (Fig. 1) 

FMR: ferromagnetic resonance 

SWR: spin wave resonance (ferromagnetic films) 

AFMR: antiferromagnetic resonance 
CESR: conduction electron spin resonance 

The information that can be obtained about solids by resonance studies 
may be categorized: 

• Electronic structure of single defects, as revealed by the fine structure of 
the absorption. 

• Motion of the spin or of the surroundings, as revealed by changes in the line 
width. 

• Internal magnetic fields sampled by the spin, as revealed by the position of 
the resonance line (chemical shift; Knight shift). 

• Collective spin excitations. 

It is best to discuss NMR as a basis for a brief account of the other reso- 
nance experiments. A great impact of NMR has been in organic chemistry and 
biochemistry, where NMR provides a powerful tool for the identification and 
the structure determination of complex molecules. This success is due to the 
extremely high resolution attainable in diamagnetic liquids. A major medical 
application of NMR is magnetic resonance imaging (MRI), which allows the 
resolution in 3D of abnormal growths, configurations, and reactions in the 
whole body. 


NUCLEAR MAGNETIC RESONANCE 

We consider a nucleus that possesses a magnetic moment jj. and an angu- 
lar momentum hi. The two quantities are parallel, and we may write 

fl = yhl ; (1) 

the magnetogyric ratio y is constant. Ry convention I denotes the nuclear 
angular momentum measured in units of h. 
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Figure 2 Energy level splitting of a nucleus of spin I = § in a static magnetic field B 0 . 


The energy of interaction with the applied magnetic field is 


if B a = B 0 z, then 


U=-/l- B a ; 


U = — = —yhBJi, . 


( 2 ) 

(3) 


The allowed values of I z are = I, I — 1, . . . , —I, and U = —mjyfiB 0 . 

In a magnetic field a nucleus with I = f has two energy levels correspond- 
ing to = ± 2 , as in Fig. 2. If ha> 0 denotes the energy difference between the 
two levels, then h(o 0 = yhB 0 or 

« 0 = yB 0 . (4) 

This is the fundamental condition for magnetic resonance absorption. 

For the proton 1 y = 2.675 X 10 4 s _1 gauss -1 = 2.675 X 10 s s -1 tesla -1 , 
so that 


i'(MHz) = 4.258 B 0 (kilogauss) = 42.58 B 0 (tesla) , 


(4a) 


where v is the frequency. One tesla is precisely 10 4 gauss. Magnetic data for 
selected nuclei are given in Table 1. For the electron spin. 


^(GHz) = 2.80 B 0 (kilogauss) = 28.0 B 0 (tesla) . 


(4b) 


^he magnetic moment ii p of the proton is 1.4106 X 10 -23 erg G" 1 or 1.4106 X 10~ 26 J T* 1 , 
and y = 2fi p /h. The nuclear magneton fi n is defined as eh/2M p c and is equal to 5.0509 X 10 -24 erg 
G _1 or 5.0509 X 10~ 2 ‘ J T -1 ; thus fx, p = 2.793 nuclear magnetons. 





Table 1 Nuclear magnetic resonance data 


H 1 


1/2 

99.98 

2.792 


For every element the most abundant magnetic isotope is shown. After 
Varian Associates NMR Table. 

Li’ 

3/2 

92.57 

3.266 

Be’ 

3/2 

100. 

-1.177 

Most abundant isotope with nonzero nuclear spin 

Na 23 

Mg 25 

i 3/2 

5/2 

Nuclear spin, in units of 

100. 

10.05 

Natural abundance of isotope, in percent 

| 2.216 

0.855 

Nuclear magnetic moment, in units of eh/2M r c 


K 39 

Ca 43 

Sc 45 

fS9 

V 51 

Cr 53 

Mn 55 

Fe 57 

Co 59 

Ni 41 

Cu 43 

Zn 47 

| 3/2 

7/2' 

7/2 


7/2 

3/2 

5/2 

1/2 

7/2 

3/2 

3/2 

5/2 

93.08 

0.13 

100. 

Kill 

-100 

9.54 

100. 

2.245 

100. 

1.25 

69.09 

4.12 

0.391 

-1.315 

4.749 

lutia 

5,139 

0.474 

3.461 

0.090 

4.639 

0.746 

2.221 

0.874 




11— ififijgilil 









Rb 85 

Sr 87 

y89 

Zr 91 

Nb 93 

Mo 95 

Tc 

Ru 101 
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Equations of Motion 

The rate of change of angular momentum of a system is equal to the 
torque that acts on the system. The torque on a magnetic moment fi in a mag- 
netic field B is fi X B, so that we have the gyroscopic equation 

fuil/dt = fiXB a ; (5) 

or 

dpJdt = y/L X B a . (6) 

The nuclear magnetization M is the sum over all the nuclei in a unit vol- 
ume. If only a single isotope is important, we consider only a single value of y, 
so that 

dM/dt = yM X B fl . (7) 

We place the nuclei in a static field B a = B 0 z. In thermal equilibrium at 
temperature T the magnetization will be along z: 

AT = 0 ; M y = 0 ; M z ~ M 0 — = CB n /T , (8) 

where the susceptibility is \o and the Curie constant C = N/j, 2 /3k B , as in 
Chapter 11. 

The magnetization of a system of spins with I = § is related to the 
population difference N l — N 2 of the lower and upper levels in Fig. 2: 
AT = (A 7 , — N 2 )/x, where the N’s refer to a unit volume. The population ratio 
in thermal equilibrium is just given by the Boltzmann factor for the energy 
difference 2/zi> 0 : 

(N 2 /A/ 1 ) 0 = cxp(-2 M B 0 /fc B T) . (9) 

The equilibrium magnetization is M 0 = N/x tanh (/j,B/k R T). 

When the magnetization component AT is not in thermal equilibrium, we 
suppose that it approaches equilibrium at a rate proportional to the departure 
from the equilibrium value M 0 : 


dM „ M 0 - M, 

(10) 

In the standard notation T l is called the longitudinal relaxation time or the 
spin-lattice relaxation time. 

If at t — 0 an unmagnetized specimen is placed in a magnetic field B 0 z, 
the magnetization will increase from the initial value M z = 0 to a final value 
AT = M 0 . Before and just after the specimen is placed in the field, the popula- 
tion N 1 wall be equal to N 2 , as appropriate to thermal equilibrium in zero mag- 
netic field. It is necessary to reverse some spins to establish the new equilibrium 
distribution in the field B 0 . On integrating (10): 


dM. _ i f* 


(ID 
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t/T t — - 


Figure 3 At time t = 0 an unmagnetized specimen M.( 0) = 0 is placed in a static magnetic field 
B 0 . The magnetization increases with time and approaches the new equilibrium value M 0 = y n B (i . 
This experiment defines the longitudinal relaxation time T v The magnetic energy density — M • B 
decreases as part of the spin population moves into the lower level. The asymptotic value at t > Tj 
is —M 0 B 0 . The energy flows from the spin system to the system of lattice vibrations; thus 7\ is also 
called the spin-lattice relaxation time. 


or 


log Mo ^° M = jr ; M.(t) = M 0 [ 1 - expl-t/T^] , (12) 


as in Fig. 3. The magnetic energy — M • B a decreases as M. approaches its new 
equilibrium value. 

Typical processes whereby the magnetization approaches equilibrium are 
indicated in Fig. 4. The dominant spin-lattice interaction of paramagnetic 
ions in crystals is by the phonon modulation of the crystalline electric field. 
Relaxation proceeds by three principal processes (Fig. 4b): direct (emission or 
absorption of a phonon); Raman (scattering of a phonon); and Orbach (inter- 
vention of a third state). 

Taking account of (10), the z component of the equation of motion (7) 
becomes 


dM. 

dt 


y(M x B a ) z + 


M 0 -M z 

Ti 


(13a) 


where (M 0 — M.)/T 1 is an extra term in the equation of motion, arising from 
the spin-lattice interactions not included in (7). That is, besides processing 
about the magnetic field, M will relax to the equilibrium value M 0 . 

If in a static field /i 0 z the transverse magnetization component M x is not 
zero, then M x will decay to zero, and similarly for M tJ . The decay occurs 
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Insulator Metal 


Figure 4a Some important processes that contribute to longitudinal magnetization relaxation in 
an insulator and in a metal. For the insulator we show a phonon scattered inelastically by the spin 
system. The spin system moves to a lower energy state, and the emitted phonon has higher energy 
by hto 0 than the absorbed phonon. For the metal we show a similar inelastic scattering process in 
which a conduction electron is scattered. 



Figure 4b Spin relaxation from 2 — » 1 by phonon emission, phonon scattering, and a two-stage 
phonon process. The temperature dependence of the longitudinal relaxation time T 1 is shown for 
the several processes. 


because in thermal equilibrium the transverse components are zero. We can 
provide for transverse relaxation: 

dMJdt = y(M X B a ) x - M x /T 2 ; (13b) 

dMy/dt = y(M X B a \ - M y /T 2 , (13c) 

where T 2 is called the transverse relaxation time. 

The magnetic energy — M • B„ does not change as M x or M y changes, pro- 
vided that B„ is along z. No energy need flow out of the spin system during 
relaxation of M x or M tJ , so that the conditions that determine T 2 may be less 
strict than for T l . Sometimes the two times are nearly equal, and sometimes 
T 1 > T 2 , depending on local conditions. 

The time T 2 is a measure of the time during which the individual moments 
that contribute to M x , M y remain in phase with each other. Different local 
magnetic fields at the different spins will cause them to precess at different 
frequencies. If initially the spins have a common phase, the phases will be- 
come random in the course of time and the values of M x , M y will become zero. 
We can think of T 2 as a dephasing time. 
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To rf supply and circuit " 
for measuring inductance 
and losses. 



rf coil 



Figure 5 Schematic arrangement for magnetic resonance experiments. 


The set of equations (13) are called the Bloch equations. They are not 
symmetrical in x, y, and z because we have biased the system with a static mag- 
netic field along z. In experiments an rf magnetic field is usually applied along 
the x or y axis. Our main interest is in the behavior of the magnetization in the 
combined rf and static fields, as in Fig. 5. The Bloch equations are plausible, 
but not exact; they do not describe all spin phenomena, particularly not those 
in solids. 

We determine the frequency of free precession of the spin system in a 
static field B„ = B 0 z and with M. = M 0 . The Bloch equations reduce to 


dM x 

dt 


M x 

- yBoM y - y ; 


dM y 

dt 


M y 

~yB 0 M x - -=r ; 

1 2 



We look for damped oscillatory solutions of the form 


(14) 


M x — m exp {—t/T) cos cot ; M y = -m exp(- t/T') sin cot . (15) 

On substitution in (14) we have for the left-hand equation 

—w sin cot — cos cot = — y B 0 sin cot — cos cot , (16) 

i 1 2 

so that the free precession is characterized by 

co 0 = yB 0 ; T' = T 2 . (17) 

The motion (15) is similar to that of a damped harmonic oscillator in two 
dimensions. The analogy suggests correctly that the spin system will show res- 
onance absorption of energy from a driving field near the frequency co 0 = yB 0 , 
and the frequency width of the response of the system to the driving field will 
be A co ~ 1/T 2 . Figure 6 shows the resonance of protons in water. 

The Bloch equations may be solved to give the power absorption from a 
rotating magnetic field of amplitude Bp 

B x = Bj cos cot ; B y = — Bj sin cot . 


(18) 
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Figure 6 Proton resonance absorption in water. 
(E. L. Ilahn.) 



After a routine calculation one finds that the power absorption is 


(CGS) 




toyMJs 

1 + («o “ ufTl 


B 


2 

l * 


The half-width of the resonance at half-maximum power is 

(Aw)y 2 = l/Tg • 


(19) 


( 20 ) 


LINE WIDTH 


The magnetic dipolar interaction is usually the most important cause of 
line broadening in a rigid lattice of magnetic dipoles. The magnetic field AB 
seen by a magnetic dipole fa due to a magnetic dipole fa at a point r 12 from 
the first dipole is 


(CGS) 


AB = 


3(fa ' 


r 12/ r 12 


■ fhnz 


rjo 


( 21 ) 


by a fundamental result of magnetostatics. 

The order of magnitude of the interaction is, with B, written for A B, 

(CGS) Bi’Vfi/r 3 . (22) 

The strong dependence on r suggests that close-neighbor interactions -will be 
dominant, so that 


(CGS) 


B j ~ jx/a " , 


(23) 
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where a is the separation of nearest neighbors. This result gives us a measure 
of the width of the spin resonance line, assuming random orientation of the 
neighbors. For protons at 2A separation, 


r „ 14 X 10~ 23 G cm 3 
‘ 8 X 10~ 24 cm 3 


= 2 gauss = 2 X 10 4 tesla . 


(24) 


To express (21 ), (22), and (23) in SI, multiply the right-hand sides by ^Att. 

Motional Narrowing 

The line width decreases for nuclei in rapid relative motion. The effect in 
solids is illustrated by Fig. 7: diffusion resembles a random walk as atoms jump 
from one crystal site to another. An atom remains in one site for an average 
time t that decreases markedly as the temperature increases. 

The motional effects on the line width arc even more spectacular in nor- 
mal liquids, because the molecules arc highly mobile. The width of the proton 
resonance line in water is only 1 (I 5 of the width expected for water molecules 
frozen in position. 

The effect of nuclear motion on T 2 and on the line width is subtle, but can 
be understood by an elementary argument. We know from the Bloch equa- 
tions that T, is a measure of the time in which an individual spin becomes 
dephased by one radian because of a local perturbation in the magnetic field 



Figure 7 Effect of diffusion of nuclei on the Li' NMR line width in metallic lithium. At low tem- 
peratures the width agrees with the theoretical value for a rigid lattice. As the temperature in- 
creases, the diffusion rate increases and the line width decreases. The abrupt decrease in line 
width above T = 230 K occurs when the diffusion hopping time r becomes shorter than 1/y 
Thus the experiment gives a direct measure ol the hopping time for an atom to change lattice 
sites. (After H. S. Gutowsky and B. R. McGarvey ) 
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Figure 8 Phase of a spin in a constant local field, as compared with dephasing of a spin which 
after fixed time intervals r hops at random among sites having local fields ± 1. 


intensity. Let (Aa>)o ~ yB. denote the local frequency deviation due to a 
perturbation B;. The local field may be caused by dipolar interactions with 
other spins. 

If the atoms are in rapid relative motion, the local field B, seen by a given 
spin will fluctuate rapidly in time. We suppose that the local field has a value 
+B t for art average time r and then changes to — B,-, as in Fig. 8a. Such a ran- 
dom change could be caused by a change of the angle between ft and r in (21). 
In the time r the spin will precess by an extra phase angle Sep = ± yBj relative 
to the phase angle of the steady precession in the applied field B 0 . 

The motional narrowing effect arises for short t such that Sep < 1. After n 
intervals of duration r the mean square dephasing angle in the field B 0 will be 

(i p 2 ) — n(8ep ) 2 = , (25) 

by analogy with a random walk process: the mean square displacement from the 
initial position after n steps of length t in random directions is (r 2 ) = n( : \ 

The average number of steps necessary to dephase a spin by one radian is 
n ~ ]/y' 1 B 2 i T 2 . (Spins dephased by much more than one radian do not con- 
tribute to the absorption signal.) This number of steps takes place in a time 

T 2 = nr - l/y 2 B?r , (26) 
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quite different from the rigid lattice result T 2 = l/'y/3 1 . From (26) we obtain as 
the line width for rapid motion with a characteristic time r: 


A w - 1 /T 2 = (y B s ) 2 * * t , 

(27) 

Aw - 1/T 2 = (Aft)) 2 r , 

(28) 


where { Aw) 0 is the line width in the rigid lattice. 

The argument assumes that (Aw) 0 t 1, as otherwise S(p will not be < 1. 
Thus Aft, <g (Ata) 0 . The shorter is r, the narrower is the resonance line! This 
remarkable effect is known as motional narrowing . 2 The rotational relax- 
ation time of water molecules at room temperature is known from dielectric 
constant measurements to be of the order of 10~ 10 s; if (Aw) 0 555 10 r> s~’, then 
(Ao>) 0 t ~ 10 5 * * and A co ~ (Ao)) 2 t ~ 1 s -i . Thus the motion narrows the proton 
resonance line to about 10 -3 of the static width. 


HYPERFINE SPLITTING 

The hyperfine interaction is the magnetic interaction between the mag- 
netic moment of a nucleus and the magnetic moment of an electron. To an 
observer stationed on the nucleus, the interaction is caused by the magnetic 
field produced by the magnetic moment of the electron and by the motion of 
the electron about the nucleus. There is an electron current about the nucleus 
if the electron is in a state with orbital angular momentum about the nucleus. 
But even if the electron is in a state of zero orbital angular momentum, there 
is an electron spin current about the nucleus, and this current gives rise to the 
contact hyperfine interaction, of particular importance in solids. We can 
understand the origin of the contact interaction by a qualitative physical argu- 
ment, given in CGS. 

The results of the Dirac theory of the electron suggest that the magnetic 
moment of /j.„ — eh/'ime of the electron arises from the circulation of an elec- 
tron with velocity c in a current loop of radius approximately the electron 
Compton wavelength, K e = h/mc ~ 10 _n cm. The electric current associated 
with the circulation is 

I ~ e X (turns per unit time) ~ ad\ e , (29) 


2 The physical ideas are due to N. Bloembergen, E. M. Purcell, and R. V. Pound, Phys. Rev. 

73, 679 (1948). The result differs from the theory of optical line width caused by strong collisions 

between atoms (as in a gas discharge), where a short t gives a broad line. In the nuclear spin prob- 

lem the collisions are weak. In most optical problems the collisions of atoms are strong enough to 

interrupt the phase of the oscillation. In nuclear resonance the phase may vary smoothly in a 

collision, although the frequency may vary suddenly from one value to another nearby value. 
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Figure 9 Magnetic field B produced by a charge moving in a circular loop. The contact part of 
the hyperfine interaction with a nuclear magnetic moment arises from the region within or near to 
the current loop. The field averaged over a spherical shell that encloses the loop is zero Thus for 
an s electron (L = 0) only the contact part contributes to the interaction. 


and the magnetic field (Fig. 9) produced by the current is 

(CCS) B ~ 7/V ~ dX-e ■ (30) 

The observer on the nucleus has the probability 

P = \>p(0) | 2 X 2 (31) 

of finding himself inside the electron, that is, within a sphere of volume X) 
about the electron. Here is the value of the electron wavefunction at the 
nucleus. Thus the average value of the magnetic field seen by the nucleus is 

B , (32) 

where pi B = eh/2mc — \e\ e is the Bohr magneton. 

The contact part of the hyperfine interaction energy' is 

U = • B = -fi, • fi B |i/r(0) | 2 « yh^ jt/r(0) | 2 I ■ S , (33) 

where l is the nuclear spin in units of h. The contact interaction in an atom has 
the form 


U = al • S . 


(34) 
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Figure 10 Energy levels in a magnetic field of a system with S = I = j. The diagram is drawn 
for the strong field approximation /j. b B > a, where a is the hyperfine coupling constant, taken to 
be positive. The four levels are labeled by the magnetic quantum numbers m s , m ; . The strong 
electronic transitions have Am, = 0, A m s = ±1. 

Values of the hyperfine constant a for the ground states of several free 
atoms are: 


nucleus 

H 1 

Li 7 

Na 23 

K 39 

K 41 

j 

1 

3 

3 

3 

3 

i 

2 

2 

2 

2 

2 

a in gauss 

507 

144 

310 

83 

85 

a in MIIz 

1420 

402 

886 

231 

127 


In a strong magnetic field the energy level scheme of a free atom or ion is 
dominated by the Zeeman energy splitting of the electron levels; the hyperfine 
interaction gives an additional splitting that in strong fields is U' = am s m : , 
where m s , m, are the magnetic quantum numbers. 

For the energy level diagram of Fig. 10 the two electronic transitions have 
the selection rules A m s = ± 1, Am, = 0, the frequencies are to = yll 0 ± a/2h. The 
nuclear transitions are not marked; they have A ?n s = 0, so that <w nuc = a/2h. The 
frequency of the nuclear transition 1 — » 2 is equal to that of 3 — > 4. 

The hyperfine interaction in a magnetic atom may split the ground energy 
level. The splitting in hydrogen is 1420 MHz; this is the radio frequency line of 
interstellar atomic hydrogen. 

Examples: Paramagnetic Point Defects 

The hyperfine splitting of the electron spin resonance furnishes valuable 
structural information about paramagnetic point defects, such as the F centers 
in alkali halide crystals and the donor impurity atoms in semiconductor crystals. 


Figure 11 An F center is a negative ion vacancy with one 
excess electron bound at the vacancy. The distribution of the 
excess electron is largely on the positive metal ions adjacent 
to the vacant lattice site. 

F Centers in Alkali Halides. An F center is a negative ion vacancy with 
one excess electron bound at the vacancy (Fig. 11). The wavefunction of the 
trapped electron is shared chiefly among the six alkali ions adjacent to the 
vacant lattice site, with smaller amplitudes on the 12 halide ions that form 
the shell of second nearest neighbors. The counting applies to crystals with the 
NaCl structure. If cp(r) is the wavefunction of the valence electron on a single 
alkali ion, then in the first (or LCAO) approximation 

</Kr) = C ^ <p(r ~ r p ) , (35) 

v 

where in the NaCl structure the six values of r p mark the alkali ion sites that 
bound the lattice vacancy. 

The width of the electron spin resonance line of an F center is determined 
essentially by the hyperfine interaction of the trapped electron with the nu- 
clear magnetic moments of the alkali ions adjacent to the vacant lattice site. 
The observed line width is evidence for the simple picture of the wavefunction 
of the electron. By line width we mean the width of the envelope of the possi- 
ble hyperfine structure components. 

As an example, consider an F center in KC1. Natural potassium is 93 per- 
cent K 39 with nuclear spin I = §. The total spin of the six potassium nuclei at 
the F center is I max = 6 X § = 9, so that the number of hyperfine components 
is 2 I max + 1 = 19; this is the number of possible values of the quantum number 
m,. There are (2 1 + l) 6 = 4 6 = 4096 independent arrangements of the six 
spins distributed into the 19 components, as in Fig. 12. Often we observe only 
the envelope of the absorption line of an F center. 

Donor Atoms in Silicon. Phosphorus is a donor when present in silicon. 
Each donor atom has five outer electrons, of which four enter diamagnetically 
into the covalent bond network of the crystal, and the fifth acts as a paramag- 
netic center of spin S = |. The experimental hyperfine splitting in the strong 
field limit is shown in Fig. 13. 

When the concentration exceeds about 1 X 10 18 donors cm -3 , the split line 
is replaced by a single narrow line. This is a motional narrowing effect (Eq. 28) 
of the rapid hopping of the donor electrons among many donor atoms. The 
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Quantum number 


Figure 12 The 4096 arrangements of the six nuclear spins of K 39 as distributed into 19 hyperfine 
components. Each component will be split further into a very large number of components by 
virtue of the residual hyperfine interaction with the 12 neighboring Cl nuclei, which may be Cl 35 
(75 percent) or Cl 3, (25 percent). The envelope of the pattern is approximately gaussian in form. 


Phosphorus (6 x 10 18 cnT 3 ) 



Phosphorus (1 x 10 17 cm -3 ); I = 1/2 


magnetic 

field 


Figure 13 Electron spin resonance lines of P donor atoms in silicon. At the higher donor con- 
centration near the metal-insulator transition, a donor electron can hop from site to site so rapidly 
that the hyperfine structure is suppressed. (After R. C. Fletcher, W. A. Yager, G. L. Pearson, and 
F. R. Merritt.) 


rapid hopping averages out the hyperfine splitting. The hopping rate increases 
at the higher concentrations as the overlap of the donor electron wavefunctions 
is increased, a view supported by conductivity measurements (Chapter 14). 

Knight Shift 

At a fixed frequency the resonance of a nuclear spin is observed at a 
slightly different magnetic field in a metal than in a diamagnetic solid. The 
effect is known as the Knight shift or metallic shift and is valuable as a tool 
for the study of conduction electrons. 
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The interaction energy of a nucleus of spin I and magnetogyric ratio 

y, is 


U = (- yi m 0 + a(S z ))L , (36) 

where the first term is the interaction with the applied magnetic field B n and 
the second is the average hyperfine interaction of the nucleus with the con- 
duction electrons. The average conduction electron spin (S z ) is related to the 
Pauli spin susceptibility y.. of the conduction electrons: M, — gNfi B (S.) = y s B 0 , 
whence the interaction may be written as 

v - + 0k) B ° ~ ( : + f) ■ <37) 

The Knight shift is defined as 


A B = a Xs 

B n 


(38) 


and simulates a fractional change in the magnetogyric ratio. By the definition 
(34) of the hyperfine contact energy, the Knight shift is given approximately by 
K ^ s l^(0)l 2 /N; that is, by the Pauli spin susceptibility increased in the ratio of 
the conduction electron concentration at the nucleus to the average conduc- 
tion electron concentration. 

Experimental values are given in Table 2. The value of the hyperfine cou- 
pling constant a is somewhat different in the metal than in the free atom be- 
cause the wave functions at the nucleus are different. From the Knight shift of 
metallic Li it is deduced that the value of |i^(0)| 2 in the metal is 0.44 of the 
value in the free atom; a calculated value of the ratio using theoretical wave 
functions is 0.49. 


Table 2 Knight shifts in NMR in metallic elements 


(At room temperature) 


Nucleus 

Knight shift 
in percent 

Nucleus 

Knight shift 
in percent 

Li 7 

0.0261 

Cu S3 

0.237 

Na 23 

0.112 

Rb 87 

0.653 

Al 27 

0.162 

Pd 105 

-3.0 

K 39 

0.265 

pf.195 

-3.533 

y51 

0.580 

Au 197 

1.4 

Cr 53 

0.69 

Pb 207 

1.47 
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0 0 

(a) (b) (c) 

Figure 14 (a) Lowest-energy orientation of a nuclear electric quadrupole moment (Q > 0) in the 
local electric field of the four ions shown. The electrons of the ion itself are not shown, (b) High- 
est energy orientation, (c) The energy level splitting for 1=1. 

NUCLEAR QUADRUPOLE RESONANCE 

Nuclei of spin J 5 1 have an electric quadrupole moment. The quadrupole 
moment Q is a measure of the ellipticity of the distribution of charge in the 
nucleus. The quantity of interest is defined classically by 

eQ = if (3s 2 - r 2 )p( r)d 3 x , (39) 

where p( r) is the charge density. An egg-shaped nucleus has Q positive; a 
saucer-shaped nucleus has Q negative. The nucleus when placed in a crystal 
will see the electrostatic field of its environment, as in Fig. 14. If the symmetry 
of this field is lower than cubic, then the nuclear quadrupole moment will lead 
to a set of energy levels split by the interaction of the quadrupole moment 
with the local electric field. 

The states that are split are the 21 + 1 states of a spin I. The quadrupole 
splittings can often be observed directly because an rf magnetic field of the 
appropriate frequency can cause transitions between the levels. The term 
nuclear quadrupole resonance refers to observations of nuclear quadrupole 
splittings in the absence of a static magnetic field. The quadrupole splittings 
are particularly large in covalently bonded molecules such as Cl 2 , Br 2 , and I 2 ; 
the splittings are of the order 10 7 or 10 s Hz. 


FERROMAGNETIC RESONANCE 

Spin resonance at microwave frequencies in ferromagnets is similar in 
principle to nuclear spin resonance. The total electron magnetic moment of 
the specimen precesses about the direction of the static magnetic field, and 
energy is absorbed strongly from the rf transverse field when its frequency is 
equal to the precessional frequency. We may think of the macroscopic vector S 
representing the total spin of the ferromagnet as quantized in the static mag- 
netic field, with energy levels separated by the usual Zeeman frequencies; the 
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magnetic selection rule A m s — ±1 allows transitions only between adjacent 
levels. 

The unusual features of ferromagnetic resonance include: 

• The transverse susceptibility components x and X are very large because 
the magnetization of a ferromagnet in a given static field is very much lar- 
ger than the magnetization of electronic or nuclear paramagnets in the same 
field. 

• The shape of the specimen plays an important role. Because the magnetiza- 
tion is large, the demagnetization field is large. 

• The strong exchange coupling between the ferromagnetic electrons tends to 
suppress the dipolar contribution to the line width, so that the ferromag- 
netic resonance lines can be quite sharp (<1 G) under favorable conditions. 

• Saturation effects occur at low rf power levels. It is not possible, as it is with 
nuclear spin systems, to drive a ferromagnetic spin system so hard that the 
magnetization AC is reduced to zero or reversed. The ferromagnetic reso- 
nance excitation breaks down into spin wave modes before the magnetiza- 
tion vector can be rotated appreciably from its initial direction. 

Shape Effects in FMR 

We treat the effects of specimen shape on the resonance frequency. Con- 
sider a specimen of a cubic ferromagnetic insulator in the form of an ellipsoid 
with principal axes parallel to x, y, z axes of a cartesian coordinate system. The 
demagnetization factors N ir N y , N z are identical with the depolarization fac- 
tors to be defined in Chapter 16. The components of the internal magnetic 
field Bj in the ellipsoid arc related to the applied field by 

B\ - B° - N X M X ; — NyM y ; Bt = B°- h\M z . 

The Lorentz field (4 tt/ 3)M and the exchange field AM do not contribute to the 
torque because their vector product with M vanishes identically. In SI we re- 
place the components of M by ja n M, with the appropriate redefinition of the N’s. 

The components of the spin equation of motion M = y(M X B ) become, 
for an applied static field B 0 z, 

= y(M y Bl - M.B' y ) = y[B () + (N y - N.)M]M y ; (40) 

dM 

= y\M(-N x M z ) - MJB U - N,M)] = -y[B 0 + (N x - N Z )M]M X . 

To first order we may set dMJdt = 0 and AC = M. Solutions of (40) with 
time dependence cxp(— iwt) exist if 

ia> y[B () + (Ny - NfM] 

-y[B 0 + (N x - N Z )M] iw 


= 0 , 
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Figure 15 FMR in a polished sphere of the 
ferromagnet yttrium iron garnet at 3.33 GHz 
and 300 K for B 0 || [111]. The total line 
width at half-power is only 0.2 G. (After 
R. C. LeCraw and E. Spencer.) 


so that the ferromagnetic resonance frequency in the applied field B 0 is 

(CGS) = /[Bq + (N y - N,)M][B 0 + ( N x - N Z )M] ; (41) 

(SI) o>l = /[B 0 + (N y - N Z )MB 0 + (N x - MJmoM] . 

The frequency w 0 is called the frequency of the uniform mode, in distinction 
to the frequencies of magnon and other nonuniform modes. In the uniform 
mode all the moments precess together in phase with the same amplitude. 

For a sphere N x = N y = AL so that <w 0 = yB 0 . A very sharp resonance 
line in this geometry is shown in Fig. 15. For a flat plate with B 0 perpendicular 
to the plate N x = N tJ = 0; N, = 4-77, whence the ferromagnetic resonance 
frequency is 

(CGS) oj 0 = y(B n - 4 ttM) ; (SI) = y(B 0 ~ HcM) . (42) 

If B 0 is parallel to the plane of the plate, the xz plane, then N x = N z = 0; N y = 
47 r, and 

(SI) Wo^/BotBo + MoM)] 172 . 


(CGS) co 0 = y[B 0 (B 0 + 4t iM)] m ■ 


(43) 
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Figure 16 Spin wave resonance in a thin film. The plane of the film is normal to the applied mag- 
netic field B 0 . A cross section of the film is shown here. The internal magnetic field is B 0 — 4itM. 
The spins on the surfaces of the film are assumed to be held fixed in direction by surface anisotropy 
forces. A uniform rf field will excite spin wave modes having an odd number of half-wavelengths. 
The wave shown is for n = 3 half-wavelengths. 


The experiments determine y, which is related to the spectroscopic split- 
ting factor g by — y = gyx B /ft. Values of g for metallic Fe, Co, Ni at room tem- 
perature are 2.10, 2.18, and 2.21, respectively. 

Spin Wave Resonance 

Uniform rf magnetic fields can excite long-wavelength spin waves in thin 
ferromagnetic films if the electron spins on the surfaces of the film see differ- 
ent anisotropy fields than the spins within the films. In effect, the surface 
spins may be pinned by surface anisotropy interactions, as shown in Fig. 16. 
If the rf field is uniform, it can excite waves with an odd number of half- 
wavelengths within the thickness of the film. Waves with an even number of 
half-wavelengths have no net interaction energy with the field. 

The condition for spin wave resonance (SWR) with the applied mag- 
netic field normal to the film is obtained from (42) by adding to the right-hand 
side the exchange contribution to the frequency. The exchange contribution 
may be written as Dk 2 , where D is the spin wave exchange constant. The as- 
sumption ka < 1 is valid for the SWR experiments. Thus in an applied field B 0 
the spin wave resonance frequencies are: 

(CGS) w 0 = y (B„ - 4t tM) + Dk 2 = y (B„ - 4t tM) + D(mr/L) 2 , (44) 

where the wavevector for a mode of n half-wavelengths in a film of thickness L 
is k = nv/L. An experimental spectrum is shown in Fig. 17. 
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Spin wave order number 


17 15 13 11 9 7 5 



Figure 17 Spin wave resonance spectrum in a Permalloy (80Ni20Fe) film at 9 GHz. The order 
number is the number of half-wavelengths in the thickness of the film. (After R, Weber.) 


ANTIFERROMAGNETIC RESONANCE 

We consider a uniaxial antiferromagnet with spins on two sublattices, 1 and 
2. We suppose that the magnetization Mj on sublattice 1 is directed along the 
+z direction by an anisotropy field B A z; the anisotropy field {Chapter 12) results 
from an anisotropy energy density l/ K (#i) = K. sin 2 Here 0 } is the angle be- 
tween Mj and the z axis, whence B A = 2 KJM, with M = |Mj = |M 2 |. The mag- 
netization M 2 is directed along the — z direction by an anisotropy field —B A z. If 
+z is an easy direction of magnetization, so is — z, If one sublattice is directed 
along +z, the other wall be directed along — z. 

The exchange interaction between Mj and M 2 is treated in the mean field 
approximation. The exchange fields are 

Bj(ex) = — AM 2 ; B 2 (ex) = — AMj , (45) 

where A is positive. Here B, is the field that acts on the spins of sublattice 1, 
and Bn acts on sublattice 2. In the absence of an external magnetic field the 
total field acting on Nl, is B, = — AM 2 + B A z; the total field on M 2 is 
B 2 = -*AM, — B 4 z, as in Fig. 18. 
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Figure 18 Effective fields in antiferromagnetic resonance. 
The magnetization M, of sublattice 1 sees a field — AM 2 + 
B a Z; the magnetization M 2 sees —A Nl, — B A z. Both ends of 
the crystal axis are "easy axes" of magnetization. 



In what follows we set M\ = M; Mi = — M. The linearized equations of 
motion are 

dM\/dt = y[M\(\M + B A ) - M(-AM£)] ; 

dM{/dt = - M\(\M + B a )] ; (46) 

dMydt = y[M£(-AM - B A ) - (-M)(-AM£)] ; 

dM'ydt = -y[(— M)(~AM*) - M£(-AM - B A ) 1 . (47) 

We define Mj = MJ + iM{\ M 2 — M| + iM\. Then (46) and (47) become, 
for time dependence exp( — iwt), 

-iwMl = -iy[Mt{B A + AM) + Mj (AM)] ; 

-iwMt = iy\M% (B a + AM) + M X + (AM)] . 

These equations have a solution if, with the exchange field B E = AM, 

yiB A + B f ) - a) y B e _ 0 

yB F y(B A + B E ) + w 

Thus the antiferromagnetic resonance frequency is given by 

*>o = T 2 B a (B a + 2 B k ) . (46) 

MnF 2 is an extensively studied antiferromagnet. The structure is shown in 
Fig. 19. The observed variation of w 0 with temperature is shown in Fig. 20. 
Careful estimates were made by Keffer of B A and B F for MnF 2 . He estimated 
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Figure 19 Chemical and magnetic structure of MnF 2 . 
The arrows indicate the direction and arrangement of the 
magnetic moments assigned to the manganese atoms. 



Figure 20 Antiferromagnetic resonance frequency for MnF 2 versus temperature. (After Johnson 
and Nethercot.) 


B e = 540 kG and B A = 8.8 kG at 0 K, whence (2 B A B E ) m = 100 kG. The 
observed value is 93 kG. 

Richards has made a compilation of AFMR frequencies as extrapolated 
toO K: 


CoF 2 

85.5 


NiF 2 

93.3 


FeF 2 

158 . 


Crystal 

Frequency in 10 10 Hz 


MnF 2 

26.0 


MnO NiO 
82.8 109 
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ELECTRON PARAMAGNETIC RESONANCE 
Exchange Narrowing 

We consider a paramagnet with an exchange interaction ] among nearest- 
neighbor electron spins. The temperature is assumed to be well above any 
spin-ordering temperature T c . Under these conditions the width of the spin 
resonance line is usually much narrower than expected for the dipole-dipole 
interaction. The effect is called exchange narrowing; there is a close analog)' 
with motional narrowing. We interpret the exchange frequency w cx ~ J/h as a 
hopping frequency 1/r. Then by generalization of the motional-narrowing 
result (28) we have for the width of the exchange-narrowed line: 

Aw = (Aw)o/w ex , (49) 

where (Aw) 2 0 = y^B 2 ) is the square of the static dipolar width in the absence 
of exchange. 

A useful and striking example of exchange narrowing is the paramagnetic 
organic crystal known as the g marker or DPPH, diphenyl picryl hydrazyl, often 
used for magnetic field calibration. This free radical has a 1.35 G half-width of 
the resonance line at half-power, only a few percent of the pure dipole width. 

Zero-Field Splitting 

A number of paramagnetic ions have crystal field splitti ngs of their magnetic 
ground state energy levels in the range of 10 10 — 10 11 Hz, conveniently accessi- 
ble by microwave techniques. The Mn 2+ ion has been studied in many crystals as 
an additive impurity. A ground state splitting in the range 10' — 10 9 Hz is 
observed, according to the environment. 


PRINCIPLE OF MASER ACTION 


Crystals can be used as microwave and light amplifiers and as sources of 
coherent radiation. A maser amplifies microwaves by the stimulated emission 
of radiation; a laser amplifies light by the same method. The principle, due to 
Townes, may be understood from the two-level magnetic system of Fig. 21 rel- 
evant for masers. There are n u atoms in the upper state and n ( atoms in the 
lower state. We immerse the system in radiation at frequency w; the amplitude 
of the magnetic component of the radiation field is B rf . The probability per 
atom per unit time of a transition between the upper and lower states is 


V = 


i 

fi / A w 


(50) 


here jjl is the magnetic moment, and Aw is the combined width of the two 
levels. The result (50) is from a standard result of quantum mechanics, called 
Fermi’s golden rule. 
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Figure 21 A two-level system, to explain maser op- 
eration. The populations of the upper and lower 
states are n„ and n ; , respectively. The frequency of 
the emitted radiation is to; the combined width of the 
states is Aw = Aw„ + Aw,. 


The net energy emitted from atoms in both upper and lower states is 


9 = 



A to 


hto • (n„ - n,) , 


(51) 


per unit time. Here 9 s denotes the power out; hco is the energy per photon; 
and n„ — n, is the excess of the number of atoms n u initially able to emit a pho- 
ton over the number of atoms n ( able to absorb a photon. 

In thermal equilibrium n„ < so there is no net emission of radiation, 
but in a nonequilibrium condition with n u > rii there will be emission. If we 
start with n u > n t and reflect the emitted radiation back onto the system, we 
increase B rf and thereby stimulate a higher rate of emission. The enhanced 
stimulation continues until the population in the upper state decreases and 
becomes equal to the population in the lower state. 

We can build up the intensity of the radiation field by placing the crystal 
in an electromagnetic cavity. This is like multiple reflection from the walls of 
the cavity. There will be some power loss in the walls of the cavity: the rate of 
power loss is 

(CCS) = (SI) (52) 

where V is the volume and Q is the Q factor of the cavity. We understand B 2 r( 
to be a volume average. 

The condition for maser action is that the emitted power 9? exceed the 
power loss 2P L . Both quantities involve B%, The maser condition can now be ex- 
pressed in terms of the population excess in the upper state: 


(CGS) 


n u — n i> 


VAB 
Btt/jlQ ’ 



n u ~n,> 


VAB 
2/XoM(? ’ 


(53) 


where /a is the magnetic moment. The line width A B is defined in terms of 
the combined line width Aw of the upper and lower states as [iAB = hAco. The 
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Figure 22 Three-level maser system. Two possible modes of operation are shown, starting from 
rf saturation of the states 3 and 1 to obtain n 3 = rp. 


central problem of the maser or laser is to obtain a suitable excess population 
in the upper state. This is accomplished in various ways in various devices. 

Three-Level Maser 

The three-level maser system (Fig. 22) is a clever solution to the excess 
population problem. Such a system may derive its energy levels from magnetic 
ions in a crystal, as Bloembergen showed. Rf power is applied at the pump 
frequency ha> p = E 3 — E : in sufficient intensity to maintain the population 
of level 3 substantially equal to the population of level 1. This is called 
saturation — see Problem 6. Now consider the rate of change of the population 
n 2 of level 2 owing to normal thermal relaxation processes. In terms of the 
indicated transition rates P, 

dn 2 /dt = — n 2 P(2 1 1) - n 2 P(2 1 3) + n 3 P(3 1 2)+njP(ll 2) . (54) 

In the steady state dn 2 /dt = 0, and by virtue of the saturation rf power we have 
n 3 = n 1; whence 


= P( 31 2) + P(1 1 2) 

P(2 1 1) + P(2 1 3) 

The transition rates are affected by many details of the paramagnetic ion 
and its environment, but one can hardly fail with this system, for either n 2 > n 1 
and we get maser action between levels 2 and 1, or n 2 < = n 3 and we get 

maser action between levels 3 and 2. The energy levels of the Er 3+ ion are 
used in communication fiber optics amplifiers. The ion is optically pumped 
from level 1 to level 3; there is fast nonradiative decay from level 3 to level 2. 
The signal at a wavelength of 1.55 ^m is amplified by stimulated emission 
from level 2 to level 1. The wavelength is favorable for long-distance propaga- 
tion in the optical fiber. The bandwidth is of the order of 4 X 10 12 Hz. 
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Figure 23 Energy level diagram of Cr 3+ in ruby, as used in laser operation. The initial excitation 
takes place to the broad bands; they decay to the intermediate levels by the emission of 
phonons, and the intermediate levels radiate photons as the ion makes the transition to the ground 
level. 


Lasers 

The same crystal, ruby, used in the microwave maser was also the first 
crystal to exhibit optical maser action, but a different set of energy levels of 
Cr 3+ are involved (Fig. 23). About 15,000 cm -1 above the ground state there 
lie a pair of states labeled 2 £, spaced 29 cm -1 apart. Above 2 E lie two broad 
bands of states, labeled 4 Fj and 4 F 2 . Because the bands are broad they can be 
populated efficiently by optical absorption from broadband light sources such 
as xenon flash lamps. 

In operation of a ruby laser both of the broad 4 F bands are populated by 
broadband light. Atoms thus excited will decay in 10“ ' sec by radiationless 
processes with the emission of phonons to the states 2 E. Photon emission from 
the lower of the states 2 E to the ground state occurs slowly, in about 5 X 10 -3 sec, 
so that a large excited population can pile up in 2 E. For laser action this popula- 
tion must exceed that in the ground state. 

The stored energy in ruby is 10 8 erg cm -3 if 10 2 ° Cr 3+ ions cm -3 are in an 
excited state. The ruby laser can emit at a very high power level if all this 
stored energy comes out in a short burst. The overall efficiency of conversion 
of a ruby laser from input electrical energy to output laser light is about one 
percent. Another popular solid state laser is the neodymium glass laser, made 
of calcium tungstate glass doped with Nd 3+ ions. This operates as a four level 
system (Fig. 24). Here it is not necessary to empty out the ground state before 
laser action can occur. 
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Figure 24 Four-level laser system, as in the neodymium 
glass laser. 
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SUMMARY 

(In CGS Units) 

• The resonance frequency of a free spin is « 0 = y B 0 , where y = fi/hl is the 
magnetogyric ratio. 

• The Bloch equations are 

dMJdt = y(M X B), - MJT 2 ■ 
dM tJ /dt = y(M X B) y — M tJ /T 2 ; 
dMJdt = y(M X B). + (M 0 - M.)/T x . 

• The half-width of the resonance at half-power is (Aoj) 1/2 = 1/T 2 . 

• The dipolar line width in a rigid lattice is (AB) 0 ~ ju./a 3 . 

• If the magnetic moments are ambulatory, with a characteristic time r <? 
l/(Aw) 0 , the line width is reduced by the factor (Aw) 0 t. In this limit 
1/Tj ~ 1/T 2 ** (Ata)o r. With exchange coupling in a paramagnet the line 
width becomes «(Aw) 2 0 w e!I . 

• The ferromagnetic resonance frequency in an ellipsoid of demagnetization 
factors N x , N y , N, is = -f[B 0 + (N y - N Z )M][B 0 + (. N x - N X )M]. 

• The antiferromagnetic resonance frequency is ta| = y 2 B A (B A + 2B E ), in a 
spherical specimen with zero applied field. Here B A is the anisotropy field 
and B e is the exchange field. 

• The condition for maser action is that n„ — n ; > VAB/8-7rp.(). 
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Problems 

1. Equivalent electrical circuit. Consider an empty coil of inductance L n in a series 
with a resistance R 0 ; show if the coil is completely filled with a spin system charac- 
terized by the susceptibility components x'M and x(a>) that the inductance at 
frequency <a becomes L = [1 + 4ttx’ M]L 0 , in series with an effective resistance 
R — 4ttcox" {a>)L 0 + R 0 . In this problem x = X' ix is defined for a linearly polar- 
ized rf field. Hint: Consider the impedance of the circuit. (CCS units.) 

2. Rotating coordinate system. We define the vector F (t) = F x (t)x + Fy{t) y + FJt)i. 
Let the coordinate system of the unit vectors x, y, z rotate with an instantaneous 
angular velocity ft, so that dx/dt = O^z — fl.y, etc. (a) Show that dV/dt = ( dE/dt ) H + 
Q X F, where {dE/dt) R is the time derivative of F as viewed in the rotating frame R. 
(b) Show that (7) may be written (dM/dt) B — yM X (B fl + il/y). This is the equation 
of motion of M in a rotating coordinate system. The transformation to a rotating 
system is extraordinarily useful; it is exploited widely in the literature, (c) Let 
n = — yB 0 z; thus in the rotating frame there is no static magnetic field. Still in the ro- 
tating frame, we now apply a dc pulse B x for a time t. If the magnetization is initially 
along z, find an expression for the pulse length t such that the magnetization will be 
directed along — z at the end of the pulse. (Neglect relaxation effects.) (d) Decribe 
this pulse as viewed from the laboratory frame of reference. 

3. Hyperfine effects on ESR in metals. We suppose that the electron spin of a con- 
duction electron in a metal sees an effective magnetic field from the hyperfine 
interaction of the electron spin with the nuclear spin. Let the z component of the 
field seen by the conduction electron be written 


*'-\i 


Vj 


where I? is equally likely to be ±f. (a) Show that (Bf/ = la/2NfN. (b) Show that 
<Bt> = 3(a/2N) 4 N\ for N>1. 

4. FMR in the anisotropy field. Consider a spherical specimen of a uniaxial ferro- 
magnetic crystal with an anisotropy energy density of the form U K = K sin 2 0, where 
9 is the angle between the magnetization and the z axis. We assume that K is posi- 
tive. Show that the ferromagnetic resonance frequency in an external magnetic field 
B 0 z is (o, | = y(B 0 + B A ), where B A = 2 KJM,. 

5. Exchange frequency resonance. Consider a ferrimagnet with two sublattices 
A and B of magnetizations M A and M B , where M B is opposite to M A when the spin 
system is at rest. The gyromagnetic ratios are y A , y B and the molecular fields are 
B , = — AM b ; B s = — AM a . Show that there is a resonance at 


“o = A 2 (y A | | - y B |M a |) 2 . 

This is called the exchange frequency resonance. 

6. Rf saturation. Given, at equilibrium for temperature i\ a two-level spin system in 
a magnetic field H 0 z, with populations N 1; N 2 and transition rates W )2 , W 2I . We 
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apply an rf signal that gives a transition rate VV' ri . (a) Derive the equation for dMJdt 
and show that in the steady state 

M z = M 0 /( 1 + 2 Wjl\) , 

where 1/T Y = W 12 + W 21 . It will be helpful to write N = \' t — N 2 ; n = N 1 — N%; and 
n 0 = N(W 21 - W 12 )/(W 21 + W 12 ). We see tliat as long as 2 W rf T 1 < 1 the absorption of 
energy from the rf field does not substantially alter the population distribution from 
its thermal equilibrium value, (h) Using the expression for n. write down the rate at 
which energy is absorbed from the rf field. What happens as W rf approaches 1/2T|? 
This effect is called saturation, and its onset may be used to measure Tp 
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Figure 1 Dielectric function e(<u) or e(cu, 0) of a free-electron gas versus frequency in units of the 
plasma frequency Electromagnetic waves propagate without damping only when e is positive 
and real. Electromagnetic waves are totally reflected from the medium when e is negative. 
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CHAPTER 14: PLASMONS, POLARITONS, AND POLARONS 


DIELECTRIC FUNCTION OF THE ELECTRON GAS 

The dielectric function e(o),K) of the electron gas, with its strong depen- 
dence on frequency and wavevector, has significant consequences for the 
physical properties of solids. In one limit, e(&>,0) describes the collective exci- 
tations of the Fermi sea — the volume and surface plasmons. In another limit, 
e(0,K) describes the electrostatic screening of the electron-electron, electron- 
lattice, and electron-impurity interactions in crystals. 

We will also use the dielectric function of an ionic crystal to derive the po- 
lariton spectrum. Later we discuss the properties of polarons. But first we are 
concerned with the electron gas in metals. 

Definitions of the Dielectric Function. The dielectric constant e of elec- 
trostatics is defined in terms of the electric field E and the polarization P, the 
dipole moment density: 

(CGS) D = E + 47rP = eE; (SI) D = e 0 E + P - ee 0 E . (1) 


Thus defined, e is also known as the relative permittivity. 

The introduction of the displacement D is motivated by the usefulness of 
this vector related to the external applied charge density p ext in the same way 
as E is related to the total charge density p = p ext + p ind , where p ind is the 
charge density induced in the system by p ext . 

Thus the divergence relation of the electric field is 

(CGS) (SI) 

div D = div eE = 47 rp exf div D = div ee 0 E = p ext (2) 

div E = 4t rp = 4 tt ( p ext + p ind ) div E = p/e„ = (p est + p ind )/e 0 . (3) 

Parts of this chapter will be written in CGS; to obtain results in SI, write l/e 0 
for 47 t . 


We need relations between the Fourier components of D, E, p, and the 
electrostatic potential <p. For brevity we do not exhibit here the frequency de- 
pendence. Define e(K) such that 

D(K) = e(K)E(K) ; (3a) 
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then (3) becomes 

div E = div 2 E(K) exp(iK • r) = 477" 2 p(K) exp(«K • r) , (3b) 

and (2) becomes 

div D = div 2 e(K)E(K) exp(iK'r) = 4-77 2 p ext (K) exp(iK>r) . (3c) 

Each of the equations must be satisfied term by term; we divide one by 
the other to obtain 


e(K) 


Pext(K) _ , Pind(K) 

p(K) p(K) 


(3d) 


The electrostatic potential <p ext defined by — V cp cxt = D satisfies the 
Poisson equation V 2 <p ext = —4rrp ext ; and the electrostatic potential cp defined 
by — V<p = E satisfies V 2 <p = —4irp. The Fourier components of the potentials 
must therefore satisfy 


^(K) 

<pi K) 


Pext(K) 

P(K) 


e(K) , 


(3e) 


by (3d). We use this relation in the treatment of the screened coulomb potential. 


■ 


Plasma Optics 

The long wavelength dielectric response e(«,0) or e(&>) of an electron gas 
is obtained from the equation of motion of a free electron in an electric field: 

rm|f = —eE . (4) 

If x and E have the time dependence e~ mt , then 

—(i) 2 mx = — eE ; x = eE/ma) 2 . (5) 

The dipole moment of one electron is ~ex = —e 2 E/ma> 2 , and the polarization, 
defined as the dipole moment per unit volume, is 

P = —nex = — E , (6) 

ma) 

where n is the electron concentration. 

The dielectric function at frequency a> is 


(CGS) 

III 

tq O 

ill 

+ 

tt*. 

4 

s, e. 

(7) 

(SI) 

, D(w) P(a>) 

e{W ’ eoE(co) eoE(<o) ' 
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The dielectric function of the free electron gas follows from (6) and (7): 

(CCS) e(«>) = 1 - ; (SI) e(to) = 1 - . (8) 

mi o €f/nw 

The plasma frequency w p is defined by the relation 


(CGS) w 2 - 4Tme 2 /m ; 


(SI) top = ne 2 !e 0 m . (9) 


A plasma is a medium with equal concentration of positive and negative 
charges, of which at least one charge type is mobile. In a solid the negative 
charges of the conduction electrons are balanced by an equal concentration of 
positive charge of the ion cores. We write the dielectric function (8) as 


e(to) = 1 - 



( 10 ) 


plotted in Fig. 1. 

If the positive ion core background has a dielectric constant labeled e(°°) 
essentially constant up to frequencies well above a) p , then (8) becomes 


e(to) = e(a>) — 4ime 2 /mto 2 = e(oo)[l — top/to 2 ] , (11) 


where to p is defined as 

to 2 = 47me 2 /e(°°)m . 

Notice that e = 0 at w = w p . 


( 12 ) 


Dispersion Relation for Electromagnetic Waves 

In a nonmagnetic isotropic medium the electromagnetic wave equation is 

(CGS) d 2 D/at 2 = c 2 V 2 E ; (SI) p () d 2 D/dt 2 = V 2 E . (13) 

We look for a solution with E exp(— twf) exp(iK • r) and D = e(to,K)E; then 

we have the dispersion relation for electromagnetic waves: 

(CGS) e(to,K)to 2 = c 2 K 2 ; (SI) e(<»,K)e 0 p 0 u> 2 = K 2 . (14) 

This relation tells us a great deal. Consider 

• e real and > 0. For a> real, K is real and a transverse electromagnetic wave 
propagates with the phase velocity c/e V2 . 

• e real and < 0. For w real, K is imaginary and the wave is damped with a 
characteristic length 1/|K|. 

• e complex. For w real, K is complex and the waves are damped in space. 
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• e — °° . This means the system has a finite response in the absence of an ap- 
plied force; thus the poles of e(w,K) define the frequencies of the free 
oscillations of the medium. 

• e = 0. We shall see that longitudinally polarized waves are possible only at 
the zeros of e. 

Transverse Optical Modes in a Plasma 

The dispersion relation (14) becomes, with (11) for e(w), 

(CGS) e(w)w 2 = e(oo)( w 2 - Z3p = c 2 fC 2 . (15) 

For 0 ) < w we have K 2 < 0, so that K is imaginary. The solutions of the wave 
equation are of the form exp( — iKlx) in the frequency region 0 < o» < <w p . 
Waves incident on the medium in this frequency region do not propagate, but 
will be totally reflected. 

An electron gas is transparent when a> > w p , for here the dielectric func- 
tion is positive real. The dispersion relation in this region may be written as 

(CGS) w 2 = + c 2 K*/e(°°) ; (16) 

this describes transverse electromagnetic waves in a plasma (Fig. 2). 

Values of the plasma frequency w p and of the free space wavelength k p = 
2n w/o) p for electron concentrations of interest are given below. A wave will prop- 
agate if its free space wavelength is less than k p ; otherwise the wave is reflected. 

n, electrons/cm 3 10 22 10 18 10 14 10 10 

t* p , s _1 5.7 X 10 15 5.7 X 10 13 5.7 X10 11 5.7 X 10 9 

A,„ cm 3.3 X 10 " 3 3.3 X 10“ 3 0.33 33 

Transparency of Metals in the Ultraviolet. From the preceding discussion 
of the dielectric function we conclude that simple metals should reflect light in 
the visible region and be transparent to light at high frequencies. A comparison 
of calculated and observed cutoff wavelengths is given in Table 1 . The reflection 
of light from a metal is entirely similar to the reflection of radio waves from the 
ionosphere, for the free electrons in the ionosphere make the dielectric con- 
stant negative at low frequencies. Experimental results for InSb with n = 4 X 
10 18 cm" 3 are shown in Fig. 3, where the plasma frequency is near 0.09 eV. 

Longitudinal Plasma Oscillations 

The zeros of the dielectric function determine the frequencies of the 
longitudinal modes of oscillation. That is, the condition 

e(«0 = 0 (17) 


determines the longitudinal frequency a>, near K = 0. 



14 Plasmom, Polaritons, and Polarons 


399 



Figure 2 Dispersion relation for transverse electromagnetic waves in a plasma. The group veloc- 
ity o g = dto/dK is the slope of the dispersion curve. Although the dielectric function is between 
zero and one, the group velocity is less than the velocity of light in vacuum. 


Table 1 Ultraviolet transmission limits of alkali metals, in A 



Li 

Na 

K 

Rb 

Cs 

A p , calculated 

1550 

2090 

2870 

3220 

3620 

Ap, observed 

1550 

2100 

3150 

3400 

— 



Figure 3 Reflectance of indium antimonide 
with n = 4 X 10 18 cm -3 . (After J. N. Hodgson.) 
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By the geometry of a longitudinal polarization wave there is a depolarization 
field E = -47 tP, discussed below. Thus D = E + 47 tP = 0 for a longitudinal 
wave in a plasma or more generally in a crystal. In SI units, D = e 0 E + P = 0. 
For an electron gas, at the zero (17) of the dielectric function (10) 

e((o L ) = 1 - w 2 /w£ = 0 , (18) 


whence w L = w p . Thus there is a free longitudinal oscillation mode (Fig. 4) 
of an electron gas at the plasma frequency described by (15) as the low- 
frequency cutoff of transverse electromagnetic waves. 

A longitudinal plasma oscillation with K = 0 is shown in Fig. 5 as a uni- 
form displacement of an electron gas in a thin metallic slab. The electron gas is 
moved as a whole with respect to the positive ion background. The displace- 
ment u of the electron gas creates an electric field E = 4vneu that acts as a 
restoring force on the gas. 

The equation of motion of a unit volume of the electron gas of concentra- 
tion n is 


(CGS) 


= —neE = —4im 2 e 2 u 

dr 


(19) 


or 

(CGS) ^ + w 2 u=0; = (20) 

This is the equation of motion of a simple harmonic oscillator of frequency w p , 
the plasma frequency. The expression for w p is identical with (9), which arose in 
a different connection. In SI, the displacement u creates the electric field 
E = neu/e 0 , whence co p — (ne 2 /e 0 m) m . 

A plasma oscillation of small wavevector has approximately the frequency 
w p . The wavevector dependence of the dispersion relation for longitudinal 
oscillations in a Fermi gas is given by 


w = w p (1 + 3/c 2 u|/l()Wp -I ) , 

where v F is the electron velocity at the Fermi energy. 


( 21 ) 




L i J l.l 


Figure 4 A plasma oscillation. The arrows indicate the direction of displacement of the electrons. 
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Figure 5 In (a) is shown a thin slab or film of a metal. A cross section is shown in (b), with 
the positive ion cores indicated by + signs and the electron sea indicated by the gray back- 
ground. The slab is electrically neutral. In (c) the negative charge has been displaced upward 
uniformly by a small distance u, shown exaggerated in the figure. As in (d), this displacement 
establishes a surface charge density — neu on the upper surface of the slab and +neu on the 
lower surface, where n is the electron concentration. An electric field E = 4 irneu is produced 
inside the slab. This field tends to restore the electron sea to its equilibrium position (b). In SI 
units, E = neu/e n . 


PLASMONS 

A plasma oscillation in a metal is a collective longitudinal excitation of 
the conduction electron gas. A plasmon is a quantum of a plasma oscillation; 
we may excite a plasmon by passing an electron through a thin metallic film 
(Figs. 6 and 7) or by reflecting an electron or a photon from a film. The charge 
of the electron couples with the electrostatic field fluctuations of the plasma 
oscillations. The reflected or transmitted electron will show an energy loss 
equal to integral multiples of the plasmon energy. 

Experimental excitation spectra for Al and Mg are shown in Fig. 8. A 
comparison of observed and calculated values of plasmon energies is given in 
Table 2; further data are given in the reviews by Raether and by Daniels. Recall 
that co p as defined by (12) includes the ion core effects by use of e(«>). 
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Incident electron 


1 

''^^Plasmon 


n 

Scattered electron 



Figure 6 Creation of a plasmon in a metal film by inelastic scattering of an electron. The incident 
electron typically has an energy 1 to 10 keV; the plasmon energy may be of the order of 10 eV. An 
event is also shown in which two plasmons are created. 


Figure 7 A spectrometer with electrostatic analyzer 
for the study of plasmon excitation by electrons. (After 
J. Daniels et al.) 



It is equally possible to excite collective plasma oscillations in dielec- 
tric films; results for several dielectrics are included. The calculated plasma 
energies of Si, Ge, and InSb are based on four valence electrons per atom. 
In a dielectric the plasma oscillation is physically the same as in a metal; the 
entire valence electron gas oscillates back and forth with respect to the ion 


cores. 



14 Planmons, Polaritons, and Polarons 


403 



Figure 8 Energy loss spectra for electrons reflected from films of (a) aluminum and (b) magne- 
sium, for primary electron energies of 2020 eV. The 12 loss peaks observed in A1 are made up of 
combinations of 10.3 and 15.3 eV losses, where the 10.3 eV loss is due to surface plasinons and the 
15.3 eV loss is due to volume plasnions. The ten loss peaks observed in Mg are made up of combi- 
nations of 7.1 eV surface plasmons and 10.6 cV volume plasinons. Surface plasmons are the sub- 
ject of Problem 1. (After C. J. Powell and J. B. Swan.) 


Table 2 Volume plasmon energies, in eV 


Material 

Observed 

Calculated 

h<j> v 

ha> r 

Metals 

Li 

7.12 

8.02 

7.96 

Na 

5.71 

5.95 

5.58 

K 

3.72 

4.29 

3.86 

Mg 

10.6 

10.9 


Al 

15.3 

15.8 


Dielectrics 

Si 

16.4-16.9 

16.0 


Ge 

16.0-16.4 

16.0 


InSb 

12.0-13.0 

12.0 



ELECTROSTATIC SCREENING 

The electric field of a positive charge embedded in an electron gas falls off 
with increasing r faster than 1/r, because the electron gas tends to gather 
around and thus to screen the positive charge. The static screening can be de- 
scribed by the wavevector dependence of the static dielectric function e(0,K). 
We consider the response of the electrons to an applied external electrostatic 
field. We start with a uniform gas of electrons of charge concentration ~n ( /> 
superimposed on a background of positive charge of concentration n t f. Let the 
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positive charge background be deformed mechanically to produce a sinusoidal 
variation of positive charge density in the x direction: 

p 1 (x) = ritf, + p ext (K) sin Kx . (22) 

The term p ext (K) sin Kx gives rise to an electrostatic field that we call the exter- 
nal field applied to the electron gas. 

The electrostatic potential cp of a charge distribution is found from the 
Poisson equation V 2 <p = — 4-np, by (3) with E = — V<p. For the positive 
charge we have 

(p = <p eM (K) sin Kx ; p = p en (K) sin Kx . (23) 

The Poisson equation gives the relation 

KVe*(K) = 4 7T Pe JK) . (24) 

The electron gas will be deformed by the combined influences of the elec- 
trostatic potential (p^K ) of the positive charge distribution and of the as yet 
unknown induced electrostatic potential ip ind (K) sin Kx of the deformation of 
the electron gas itself. The electron charge density is 

p~(x) = -nge + PtadfK) sin Kx , (25) 

where p im i(K) is the amplitude of the charge density variation induced in the 
electron gas. We want to find p ini1 (f£) in terms of p ext (K). 

The amplitude of the total electrostatic potential <p(K) — (p ext (K ) + (p im} (K) of 
the positive and negative charge distributions is related to the total charge density 
variation p(K) — p ext (K ) + p ind (K) by the Poisson equation. Then, as in Eq. (24), 

K 2 <p(K) = 4 tt P (K) . (26) 

To go further we need another equation that relates the electron concen- 
tration to the electrostatic potential. We develop this connection in what is 
called the Thomas-Fermi approximation. The approximation consists in assum- 
ing that a local internal chemical potential can be defined as a function of the 
electron concentration at that point. Now the total chemical potential of the 
electron gas is constant in equilibrium, independent of position. In a region 
where there is no electrostatic contribution to the chemical potential we have 

M = e “’ = £ (3l73no)2/3 (27) 

at absolute zero, according to (6.17). In a region where the electrostatic poten- 
tial is <p(x) 7 the total chemical potential (Fig. 9) is constant and equal to 

p. = e F (x) - e<p(x ) = |^[37 T l n(x)f ri - e<p(x) = J^[377 \>] 2 ' 3 , (28) 

where e F (x) is the local value of the Fermi energy. 

The expression (28) is valid for static electrostatic potentials that vary 
slowly compared with the wavelength of an electron at the Fermi level; 
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Chemical potential 



Region of normal 
electron concentration 


-e<p(r) 


Region of enhanced 
electron concentration 


Figure 9 In thermal and diffusive equilibrium the chemical potential is constant; to maintain it 
constant we increase the electron concentration in regions of space where the potential energy is 
low, and we decrease the concentration where the potential is high. 


specifically, the approximation is q < k F . By a Taylor series 
Eq. (28) may be written as 

^■E n(x) ~ rc n ] = ecp(x) . 
ariQ 

expansion of e F , 

(29) 

From (27) we have de F /dn 0 — 2e f /3n 0 , whence 


_ 3 e<p{x) 

n(x) n 0 - 2 n 0 €f . 

(30) 

The left-hand side is the induced part of the electron concentration; thus 

the Fourier components of this equation are 


p md (K) = -(3n 0 e 2 /2e F )cp(K) . 

(31) 

By (26) this becomes 


Pmd(K) = —(67 m 0 e 2 /e F K 2 )p(K) . 

(32) 

By (3d) we have 


e(0,K) = 1 - 7 (kT = 1 + ; 

(33) 

here, after some rearrangement. 


k 2 = Simpler — 4 (3/ir) 1/3 nJ /3 /a 0 = 4it e 2 D(e F ) , 

(34) 


where a 0 is the Bohr radius and D(e F ) is the density of states for a free electron 
gas. The approximation (33) for e(0,K) is called the Thomas-Fermi dielectric 
function, and 1/fc, is the Thomas-Fermi screening length, as in (40) below. For 
copper with n 0 — 8.5 X 10 22 cm* 3 , the screening length is 0.55 A. 

We have derived two limiting expressions for the dielectric function of an 
electron gas: 

k 2 a>l 

e(0,K) = 1 + ; e(o»,0) = 1 - ■ (35) 

K o> 

We notice that e(0 ,K) as K — t 0 does not approach the same limit as e(<w,0) as 
(o — > 0. This means that great care must be taken with the dielectric function 
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near the origin of the a>-K plane. The full theory for the general function 
e{oj,K) is due to Lindhard.' 


Screened Coulomb Potential. We consider a point charge q placed in a sea 
of conduction electrons. The Poisson equation for the unscreened coulomb 
potential is 

V 1 2 <po ~ — 4.Trq8(r) , (36) 

and we know that <p 0 = q/r. Let us write 

<Po( r) = (27 rr 3 / dK <p 0 (K) exp(iK • r) . (37) 


We use in (36) the Fourier representation of the delta function: 

S( r) = (2i t ) -3 / dK exp(tK • r) , (38) 

whence K 2 (p 0 (K ) = ATvq. 

By (3e), 

<p 0 (K)/<p(K) = e(K) , 


where cp(K) is the total or screened potential. We use e(K) in the Thomas- 
Fermi form (33) to find 

4mi 

« K> = WTky {m 

The screened coulomb potential is the transform of <p(K): 

4m f” 9ttK 2 f 

<p{r) = ——5 dK — d ( cos d) exp (iKr cos 6) 

(27r) J Jo K? + k 2 J - 1 

_ % f ” K sin Kr _ C 1 , . , , An s 

jrr J o dK ^2 ^.2 r ex P( ^ s r ) (40) 

as in Fig. 10a. The screening parameter k s is defined by (34). The exponential 
factor reduces the range of the coulomb potential. The bare potential q/r is 
obtained on letting the charge concentration n c — » 0, for then k s — t 0. In the 
vacuum limit (p(K ) — 4irq/K 2 . 

One application of the screened interaction is to the resistivity of certain 
alloys. The atoms of the series Cu, Zn. Ga, Ge, As have valences 1, 2, 3, 4, 5. 
An atom of Zn, Ga, Ge, or As added substitutionally to metallic Gu has an ex- 
cess charge, referred to Cu, of 1, 2, 3, or 4 if all the valence electrons join the 
conduction band of the host metal. The foreign atom scatters the conduction 
electrons, with an interaction given by the screened coulomb potential. This 
scattering contributes to the residua! electrical resistivity, and calculations by 
Mott of the resistivity increase are in fair agreement with experiment. 


1 A good discussion of the Lindhard dielectric function is given by J Ziman, Principles of the 

theory of solids, 2nd ed., Cambridge, 1072, Chapter 5. The algebraic steps in the evaluation of 

Ziman’s equation (5.16) are given in detail by C. Kittel, Solid state physics 22, 1 (1968), Section 6. 
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Figure 10a Comparison of screened and unscreened 
coulomb potentials of a static unit positive charge. The 
screening length 1 /k s is set equal to unity. The static 
screened interaction is taken in the Thomas-Fermi ap- 
proximation, which holds for low wavevectors q < k f - 
More complete calculations with all wavevectors in- 
cluded exhibit spatial oscillations, called Friedel 
oscillations, in 2 k F r and are plotted in QTS, p. 114. 





Pseudopotential Component U(0). In the legend to Fig. 9.22b we stated a 
result that is important in pseudopotential theory: “For very small k the poten- 
tial approaches — § times the Fermi energy.” The result, which is known as the 
screened ion limit for metals, can be derived from Eq. (39). When converted to 
the potential energy of an electron of charge e in a metal of valency z with n 0 
ions per unit volume, the potential energy component at k = 0 becomes 

[7(0) = —ezn 0 (p(0) - —Anzu^/k 2 . (41) 

The result (34) for k 2 s in this situation reads 

k 2 = 6irzn^e 2 /e F , (42) 

whence 

U(0) = -|e F . (43) 

Mott Metal-Insulator Transition 

A crystal composed of one hydrogen atom per primitive cell should always 
be a metal, according to the independent-electron model, because there will al- 
ways be a half-filled energy band within which charge transport can take place. 
A crystal with one hydrogen molecule per primitive cell is a different matter, 
because the two electrons can fill a band. Under extreme high pressure, as in 
the planet Jupiter, it is possible that hydrogen occurs in a metallic form. 

But let us imagine a lattice of hydrogen atoms at absolute zero: will this be 
a metal or an insulator? The answer depends on the lattice constant, with small 
values of a giving a metal and large values giving an insulator. Mott made an 
early estimate of the critical value a c of the lattice constant that separates the 
metallic state from the insulating state: a c = 4.5 a 0 , where a 0 = h 2 /me 2 is the ra- 
dius of the first Bohr orbit of a hydrogen atom. 

On one approach to the problem, we start in the metallic state where a 
conduction electron sees a screened coulomb interaction from each proton: 

U(r) = — (e 2 /r) exp(— k s r) , (44) 
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Figure 10b Scmilog plot of observed “zero tempera- 
ture” conductivity rr(0) versus donor concentration n for 
phosphorous donors in silicon. (After T. F. Rosenbaum, 
ctal.) 



where kf = 3.939 4 /3 /a u , as in (34), where n 0 is the electron concentration. At 
high concentrations k s is large and the potential has no hound state, so that we 
must have a metal. 

The potential is known to have a bound state when k s is smaller than 
1.19/a 0 . With a bound state possible the electrons may condense about the 
protons to form an insulator. The inequality may be written in terms of n 0 as 

3.9394%. < 1.42/4 • (45) 

With n 0 — 1/a 3 for a simple cubic lattice, wc may have an insulator when a r > 
2.78 a 0 , which is not far from the Mott result 4.5 i 2 0 found in a different way. 

The term metal-insulator transition has come to denote situations where 
the electrical conductivity of a material changes from metal to insulator as a 
function of some external parameter, which may be composition, pressure, 
strain, or magnetic field. The metallic phase may usually be pictured in terms 
of an independent-electron model; the insulator phase may suggest important 
electron-electron interactions. Sites randomly occupied introduce new and in- 
teresting aspects to the problem, aspects that lie within percolation theory. 
The percolation transition is beyond the scope of our book. 

When a semiconductor is doped with increasing concentrations of donor 
(or acceptor) atoms, a transition will occur to a conducting metallic phase. 
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Experimental results for P atoms in silicon are shown in Fig. 10b. Here the 
insulator-metal transition takes place when the concentration is so high that 
the ground state wavefunctions of electrons on neighboring impurity atoms 
overlap significantly. 

The observed value of the critical concentration in the Si : P alloy system 
is n c = 3.74 X 10 18 cm 3 , as in the figure. If we take 32 X 10” s cm as the 
radius of the ground state of a donor in Si in the spherical approximation, then 
by the Mott criterion a c — 1.44 X 10" 6 cm. The P atoms are believed to oc- 
cupy lattice sites at random, but if instead their lattice were simple cubic, the 
critical Mott concentration would be 

n c = 1/a 3 = 0.33 X 10 ls cm" 3 , (46) 

appreciably less than the observed value. It is usual in the semiconductor liter- 
ature to refer to a heavily-doped semiconductor in the metallic range as a 

degenerate semiconductor. 


Screening and Phonons in Metals 

An interesting application of our two limiting forms of the dielectric func- 
tion is to longitudinal acoustic phonons in metals. For longitudinal modes the 
total dielectric function, ions plus electrons, must be zero, by (17). Provided 
the sound velocity is less than the F ermi velocity of the electrons, we may use 
for the electrons the Thomas-Fermi dielectric function 


e d (a),K) = 1 + kf/K 2 . (47) 

Provided also that the ions are well-spaced and move independently, we may 
use for them the plasm on e(<w,0) limit with the approximate mass M. 

The total dielectric function, lattice plus electrons, but without the elec- 
tronic polarizability of the ion cores, is 


e(w,K) =1- 


4ime 2 fcf 
Mo/ K 2 ' 


(48) 


At low K and o> wc neglect the term 1. At a zero of e(a>,K) we have, with 
_ 1 0 
€ F — 2 m%, 


to 


2 


Avne 2 r* _ 4 rrne 2 
Mk 2 M 


e F ,S2 _ 
67me 2 


m 2 

3M Vfr 


K 


(49) 


or 

<i) = vK ; v = (m/3M) 1/2 u F . (50) 

This describes long wavelength longitudinal acoustic phonons. 

In the alkali metals the result is in quite good agreement with the 
observed longitudinal wave velocity. For potassium we calculate v = 1.8 X 
10 5 cm s“ L ; the observed longitudinal sound velocity at 4 K in the [100] direc- 
tion is 2.2 X lO 3 cm s"'. 
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There is another zero of e(w,iC) for positive ions imbedded in an electron 
sea. For high frequencies we use the dielectric contribution — at^/w 2 of the 
electron gas: 


e(tu,0) — 1 


47 me 2 _ 4mie 2 

Mw 2 mor 


(51) 


and this function has a zero when 


2 _ 47me 2 _1 = _1_, _1_ 

W ; M M m ' 


(52) 


This is the electron plasma frequency (20), but with the reduced mass correc- 
tion for the motion of the positive ions. 


POLARITONS 

Longitudinal optical phonons and transverse optical phonons were dis- 
cussed in Chapter 4, but we deferred treatment of the interaction of trans- 
verse optical phonons with transverse electromagnetic waves. When the two 
waves are at resonance the phonon-photon coupling entirely changes the char- 
acter of the propagation, and a forbidden band is established for reasons that 
have nothing to do with the periodicity of the lattice. 

By resonance we mean a condition in which the frequencies and wavevectors 
of both waves are approximately equal. The region of the crossover of the two 
dashed curves in Fig. 11 is the resonance region; the two dashed curves are the 
dispersion relations for photons and transverse optical phonons in the absence of 
any coupling between tbem. In reality', however, there always is coupling implicit 
in Maxwell’s equations and expressed by the dielectric function. The quantum of 
the coupled phonon-photon transverse wave field is called a polariton. 

In this section we see how the coupling is responsible for the dispersion 
relations shown as solid curves in the figure. All takes place at very low values 
of the wavevector in comparison with a zone boundary, because at crossover 
w(photon) = ck(photon) — w(phonon) 10 13 s _1 ; thus A; = 300 cnT 1 . 

An early warning: although the symbol to L will necessarily arise in the the- 
ory, the effects do not concern longitudinal optical phonons. Longitudinal 
phonons do not couple to transverse photons in the bulk of a crystal. 

The coupling of the electric field E of the photon with the dielectric polar- 
ization P of the TO phonon is described by the electromagnetic wave equation: 
(CGS) c 2 K?E = co 2 (E + 4t rP) . (53) 

At low wavevectors the TO phonon frequency w T is independent of K. The po- 
larization is proportional to the displacement of the positive ions relative to 
the negative ions, so that the equation of motion of the polarization is like that 
of an oscillator and may be written as, with P — Nqu, 

-orP + <4 P = (Nq-/M)E , 


(54) 
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Figure 11 A plot of the observed energies and wavevectors of the polaritons and of the LO 
phonons in GaP. The theoretical dispersion curves are shown by the solid lines. The dispersion 
curves for the uncoupled phonons and photons are shown by the short, dashed lines. (After 
C. H. Henry and J. J. Hopfield.) 


where there are N ion pairs of effective charge q and reduced mass M, per unit 
volume. For simplicity we neglect the electronic contribution to the polarization. 
The equations (53) and (54) have a solution when 


<n 2 - c 2 K 2 4 iro) 2 
Nq 2 /M or - <4 


(55) 


This gives the polariton dispersion relation, similar to that plotted in Figs. 11 
and 12. At K — 0 there are two roots, o> = 0 for the photon and 

w 2 = &)| + 4 t rNq z /M (56) 


for the polariton. Here (o T is the TO phonon frequency in the absence of 
coupling with photons. 

The dielectric function obtained from (54) is: 


4ttNu 2 /M 

e(<w) = 1 + 4 t tF/E = 1 + , ^ , . (57) 

0)j ft) 

If there is an optical electronic contribution to the polarization from the ion 
cores, this should be included. In the frequency range from zero up through 
the infrared, we write 


e(<w) = e(=°) 


+ 


4rNq~/M 


2 2 

0) T — Cl) 


(58) 
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Figure 12 Coupled modes of photons and transverse optical phonons in an ionic crystal. The fme 
horizontal line represents oscillators of frequen cy <a T in the absence of coupling to the electromag- 
netic field, and the fine line labeled (o = cK/Ve(“) corresponds to electromagnetic waves in the 
crystal, but uncoupled to the lattice oscillators <o T . The heavy lines are the dispersion relations in 
the presence of coupling between the lattice oscillators and the electromagnetic wave. One effect 
of the coupling is to create the frequency gap between a>, and a> T : within this gap the wavevector is 
pure imaginary of magnitude given by the broken line in the figure. In the gap the wave attenuates 
as exp( — |K|r), and we see from the plot that the attenuation is much stronger near <u T than near 
to L . The character of the branches varies with K: there is a region of mixed electric-mechanical as- 
pects near the nominal crossover. Note, finally, it is intuitively obvious that the group velocity of 
light in the medium is always <c, because the slope i W5K for the actual dispersion relations 
(heavy lines) is everywhere less than the slope c for the uncoupled photon in free space. 


in accord with the definition of e(°°) as the optical dielectric constant, ob- 
tained as the square of the optical refractive index. 

We set <o = 0 to obtain the static dielectric function: 

e{0) - e(<») + 4 irNcp/Mul , (59) 

which is combined with (58) to obtain e(< u) in terms of accessible parameters: 

e(w) = e(°°) + [e(0) - e(°°)] ^ 

co j - or 


or 


wfe(O) - o> 2 e(°°) „ , 

£(<w) r s = fT(°°) 


■ 


2 2 

(0 L — (x) 

2 2 

(x)j — CO 


(60) 


The zero of e(co) defines the longitudinal optical phonon frequency w L , as the 
pole of e(w) defines w T . The zero gives 

e(“)w| = e(0)w| . 


( 61 ) 
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(a) 


Figure 13a Plot of e(<o) from (60) for e(°°) = 2 and e(0) = 3. The dielectric constant is negative 
between (O = <o T and a> L — (3/2) 1/2 <Wj-; that is, between the pole (infinity) of e(<u) and the zero of 
f(aj). Incident electromagnetic waves of frequencies in the shaded regions ui T < u < w L will not 
propagate in the medium, but will be reflected at the boundary. 
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Figure 13b Dielectric function (real part) of SrF 2 measured over a wide frequency range, 
exhibiting the decrease of the ionic polarizability at high frequencies. (A. von Hippel.) 
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t* t* 



TO phonon LO phonon 


Figure 14 Relative displacements of the positive and negative ions at one instant of time for a 
wave in an optical mode traveling along the z axis. The planes of nodes (zero displacement) are 
shown; for long wavelength phonons the nodal planes are separated by many planes of atoms. In 
the transverse optical phonon mode the particle displacement is perpendicular to the wavevector 
K; the macroscopic electric field in an infinite medium will lie only in the ± x direction for the 
mode shown, and by the symmetry of the problem SE x /dx = 0. It follows that div E = 0 for a TO 
phonon. In the longitudinal optical phonon mode the particle displacements and hence the dielec- 
tric polarization P are parallel to the wavevector. The macroscopic electric field E satisfies D = 
E + 4irP = 0 in CGS or e 0 E + P = 0 in SI; by symmetry E and P are parallel to the z axis, and 
dEJdz # 0. Thus div E J= 0 for an LO phonon, and e(o>) div E is zero only if e(io) = 0. 


Waves do not propagate in the frequency region for which e((o) is nega- 
tive, between its pole at w = w T and its zero at co = cj l , as in Fig. 13. For nega- 
tive e, waves do not propagate because then K is imaginary for real to, and 
exp (iKx) — » exp( — |K|x), damped in space. The zero of e(to), by our earlier argu- 
ment, is the LO frequency at low K, Fig. 14. Just as with the plasma frequency 
to p , the frequency co L has two meanings, one as the LO frequency at low K and 
the other as the upper cutoff frequency of the forbidden band for propagation 
of an electromagnetic wave. The value of w L is identical at both frequencies. 


LST Relation 

We write (61) as 

o*l _ e (0) 
cof e(°°) 


(62) 


where e(0) is the static dielectric constant and e(<») is the high-frequency limit 
of the dielectric function, defined to include the core electron contribution. 
This result is the Lyddane-Sachs-Teller relation. The derivation assumed a 
cubic crystal with two atoms per primitive cell. For soft modes with co T — > 0 we 
see that e(0) -4 °o, a characteristic of ferroelectricity. 

Undamped electromagnetic waves with frequencies within the gap cannot 
propagate in a thick crystal. The reflectivity of a crystal surface is expected to 
be high in this frequency region, as in Fig. 15. 
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Figure 15 Reflectance of a crystal of NaCl at several temperatures, versus wavelength. The nom- 
inal values of u> L and co T at room temperature correspond to wavelengths of 38 and 61 X 10 - ' 1 cm, 
respectively. (After A. Mitsuishi et al.) 


Wave number in cm 1 



Figure 16 Reflectance versus wavelength of a LiF film backed by silver, for radiation incident 
near 30 u . The longitudinal optical phonon absorbs strongly the radiation polarized ( p ) in the plane 
normal to the film, but absorbs hardly at all the radiation polarized (s) parallel to the film. (After 
D. W. Berreman.) 
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For films of thickness less than a wavelength the situation is changed. Be- 
cause for frequencies in the gap the wave attenuates as exp( — |K|x), it is possi- 
ble for the radiation to be transmitted through a film for the small values of |K| 
near <a L , but for the large values of |K| near w T the wave wall be reflected. By 
reflection at nonnormal incidence the frequency co L of longitudinal optical 
phonons can be observed, as in Fig. 16. 

Experimental values of e(Q), e(°°), and w T are given in Table 3, with 
values of co L calculated using the LST relation, Eq. (62). We compare values of 


Table 3 Lattice parameters, chiefly at 300 K 


Crystal 

Static 

dielectric 

constant 

e(0) 

Optical 

dielectric 

constant 

«K“) 

to T , inl0 13 s 1 
experimental 

c o L> in 10 13 s _1 
LST relation 

LiH 

12.9 

3.6 

n. 

21. 

LiF 

8.9 

1.9 

5.8 

12. 

LiCl 

12.0 

2.7 

3.6 

7.5 

LiBr 

13.2 

3.2 

3.0 

6.1 

NaF 

5.1 

1.7 

4.5 

7.8 

NaCl 

5.9 

2.25 

3.1 

5.0 

NaBr 

6.4 

2.6 

2.5 

3.9 

KF 

5.5 

1.5 

3.6 

6.1 

KCl 

4.85 

2.1 

2.7 

4.0 

KI 

5.1 

2.7 

1.9 

2.6 

RbF 

65 

1.9 

2.9 

5.4 

Rbl 

5.5 

2.6 

1.4 

1.9 

CsCl 

7.2 

2.6 

1.9 

3.1 

Csl 

5.65 

3.0 

1.2 

1.6 

T1C1 

31.9 

5.1 

1.2 

3.0 

TlBr 

29,8 

5.4 

0.81 

1.9 

AgCl 

12,3 

4.0 

1.9 

3.4 

AgBr 

13.1 

4.6 

1.5 

2.5 

MgO 

9.8 

2.95 

7.5 

14. 

GaP 

10.7 

8.5 

6.9 

7.6 

GaAs 

13.13 

10.9 

5.1 

5.5 

GaSb 

15.69 

14.4 

4.3 

4.6 

InP 

12.37 

9.6 

5.7 

6.5 

InAs 

14.55 

12.3 

4.1 

4.5 

InSb 

17.88 

15.6 

3.5 

3.7 

SiC 

9.6 

6.7 

14.9 

17.9 

C 

5.5 

5.5 

25.1 

25.1 

Si 

11.7 

11.7 

9.9 

9.9 

Ge 

15.8 

15.8 

5.7 

5.7 
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w L /u) T obtained by inelastic neutron scattering with experimental values of 
[e(0)/e(oo) 

GaAs 

1.07 ± 0.02 

1.08 


)] 1/2 obtained by dielectric measurements: 


Nal 

KBr 

W L /(0 T 

1.44 ± 0.05 

1.39 ± 0.02 

[e( 0 )/e(oo)r 

1.45 ± 0.03 

1.38 ± 0.03 


The agreement with the LST relation is excellent. 


ELECTRON-ELECTRON INTERACTION 


Fermi Liquid 

Because of the interaction of the conduction electrons with each other 
through their electrostatic interaction, the electrons suffer collisions. Further, 
a moving electron causes an inertial reaction in the surrounding electron gas, 
thereby increasing the effective mass of the electron. The effects of electron- 
electron interactions are usually described within the framework of the 
Landau theory of a Fermi liquid. The object of the theory is to give a unified 
account of the effect of interactions. A Fermi gas is a system of noninteracting 
fermions; the same system with interactions is a Fermi liquid. 

Landau’s theory gives a good account of the low-lying single particle exci- 
tations of the system of interacting electrons. These single particle excitations 
are called quasiparticles; they have a one-to-one correspondence with the 
single particle excitations of the free-electron gas. A quasiparticle may be 
thought of as a single particle accompanied by a distortion cloud in the elec- 
tron gas. One effect of the coulomb interactions between electrons is to 
change the effective mass of the electron; in the alkali metals the increase is 
roughly of the order of 25 percent. 

Electron-Electron Collisions. It is an astonishing property of metals that 
conduction electrons, although crowded together only 2 A apart, travel long 
distances between collisions with each other. The mean free paths for 
electron-electron collisions are longer than 10 4 A at room temperature and 
longer than 10 cm at 1 K. 

Two factors are responsible for these long mean free paths, without which 
the free-electron model of metals would have little value. The most powerful 
factor is the exclusion principle (Fig. 17), and the second factor is the screen- 
ing of the coulomb interaction between two electrons. 

We show how the exclusion principle reduces the collision frequency of an 
electron that has a low excitation energy e, outside a filled Fermi sphere 
(Fig. 18). We estimate the effect of the exclusion principle on the two-body 
collision 1 + 2 — » 3 + 4 between an electron in the excited orbital 1 and an 
electron in the filled orbital 2 in the Fermi sea. It is convenient to refer all 
energies to the Fermi level jx taken as the zero of energy; thus will be 
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Figure 17 A collision between two electrons 
of wavevectors kj and k 2 . After the collision 
the particles have wavevectors k 3 and k 4 . The 
Pauli exclusion principle allows collisions only 
to final states k 3> k 4 which were vacant before 
the collision. 




(a) 



(b) 



Figure 18 In (a) the electrons in initial orbitals 1 and 2 collide. If the orbitals 3 and 4 are initially 
vacant, the electrons 1 and 2 can occupy orbitals 3 and 4 after the collision. Energy and momentum 
are conserved. In (b) the electrons in initial orbitals 1 and 2 have no vacant final orbitals available 
that allow energy to be conserved in the collision. Orbitals such as 3 and 4 would conserve energy 
and momentum, but they are already filled with other electrons. In (c) we have denoted with X the 
wavevector of the center of mass of 1 and 2. All pairs of orbitals 3 and 4 conserve momentum and 
energy if they lie at opposite ends of a diameter of the small sphere. The small sphere was drawn 
from the center of mass to pass through 1 and 2. But not all pairs of points 3, 4 are allowed by the 
exclusion principle, for both 3, 4 must lie outside the Fermi sphere; the fraction allowed is ~ei/e F . 
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positive and e 2 will be negative. Because of the exclusion principle the orbitals 
3 and 4 of the electrons after collision must lie outside the Fermi sphere, all 
orbitals within the sphere being already occupied; thus both energies e 3 , e 4 
must he positive referred to zero on the Fermi sphere. 

The conservation of energy requires that k 2 | < e,, for otherwise e 3 + e 4 = 
e 4 + e 2 could not be positive. This means that collisions are possible only if the 
orbital 2 lies within a shell of thickness within the Fermi surface, as in Fig. 
18a. Thus the fraction of the electrons in filled orbitals provides a suit- 

able target for electron 1. But even if the target electron 2 is in the suitable 
energy shell, only a small fraction of the final orbitals compatible with conser- 
vation of energy and momentum are allowed by the exclusion principle. This 
gives a second factor of e ; /e F . 

In Fig. 18c we show a small sphere on which all pairs of orbitals 3, 4 
at opposite ends of a diameter satisfy the conservation laws, but collisions can 
occur only if both orbitals 3, 4 lie outside the Fermi sea. The product of the 
two fractions is (e^ep) 2 . If Cj corresponds to 1 K and e F to 5 X 10 4 K, wc have 
(e^ep) 2 — 4 X 1CT 10 , the factor by which the exclusion principle reduces the 
collision rate. 

The argument is not changed for a thermal distribution of electrons at 
a low temperature such that k B T e F . Wc replace by the thermal energy 
x k B T, and now the rate at which electron-electron collisions take place is re- 
duced below the classical value by ( k B T/e t ) 2 y so that the effective collision 
cross section <r is 

u ~ (k B r/e F ) 2 o- 0 , (63) 

where (r 0 is the cross section for the electron-electron interaction. 

The interaction of one electron with another has a range of the order of 
the screening length 1 /k s as in (34). Numerical calculations give the effective 
cross section with screening for collisions between electrons as of the order of 
l(T l ° cm 2 or 10 A 2 in typical metals. The effect of the electron-gas background 
in electron-electron collisions is to reduce (7 0 below the value expected from 
the Rutherford scattering equation for the unscreened coulomb potential. 
However, much the greater reduction in the cross section is caused by the 
Pauli factor ( k e T/e F ) z . 

At room temperature in a typical metal k B T/e F is — lO -2 , so that 
(7 ~ 10~V n —TO - 18 cm 2 . The mean free path for electron-electron collisions is 
t ~ 1/ncr — 1 0 1 cm at room temperature. This is longer than the mean free 
path due to electron-phonon collisions by at least a factor of 10, so that at 
room temperature collisions with phonons are likely to be dominant. At liquid 
helium temperatures a contribution proportional to T 2 has been found in the 
resistivity of a number of metals, consistent with the form of the electron- 
electron scattering cross section (63). The mean free path of electrons in in- 
dium at 2 K is of the order of 30 cm, as expected from (63). Thus the Pauli 



principle explains one of the central questions of the theory of metals: how do 
the electrons travel long distances without colliding with each other? 


ELECTRON-PHONON INTERACTION: POLARONS 

The most common effect of the electron-phonon interaction is seen in the 
temperature dependence of the electrical resistivity, which for pure copper is 
1.55 microhm-cm at 0°C and 2.28 microhm-cm at 100°C. The electrons are 
scattered by the phonons, and the higher the temperature, the more phonons 
there are and hence more scattering. Above the Debye temperature the num- 
ber of thermal phonons is roughly proportional to the absolute temperature, 
and we find that the resistivity increases approximately as the absolute tem- 
perature in any reasonably pure metal in this temperature region. 

A more subtle effect of the electron-phonon interaction is the apparent in- 
crease in electron mass that occurs because the electron drags the heavy ion 
cores along with it. In an insulator the combination of the electron and its 
strain field is known as a polaron, Fig. 19. The effect is large in ionic crystals 
because of the strong coulomb interaction between ions and electrons. In co- 
valent crystals the effect is weak because neutral atoms have only a weak inter- 
action with electrons. 

The strength of the elcctron-latticc interaction is measured by the dimen- 
sionless coupling constant a given by 


2 deformation energy 

2 a= h^ L 


(64) 


where a) L is the longitudinal optical phonon frequency near zero wavevector. 
We view |a as “the number of phonons which surround a slow-moving elec- 
tron in a crystal.” 

Values of a deduced from diverse experiments and theoiy are given in 
Table 4, after F. C. Brown. The values of a are high in ionic crystals and low in 
covalent crystals. The values of the effective mass m* pol of the polaron are from 
cyclotron resonance experiments. The values given for the band effective mass 
m* were calculated from m* ol . The last row in the table gives the factor m' pa \/m* 
by which the band mass is increased by the deformation of the lattice. 

Theory relates the effective mass of the polaron m* 0 ] to the effective band 
mass rn of the electron in the undeformed lattice by the relation 


m i*»i = m* 


( 1 - 0.0008a 2 \ . 

\1 — |a + 0.0034a 2 / ’ 


(65) 


for a 1 this is approximately m*(l + |a). Because the coupling constant a is 
always positive, the polaron mass is greater than the bare mass, as we expect 
from the inertia of the ions. 
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Figure 19 The formation of a polaron. (a) A conduction electron is shown in a rigid lattice of 
an ionic crystal, KC1. The forces on the ions adjacent to the electron are shown, (b) The electron is 
shown in an elastic or deformable lattice. The electron plus the associated strain field is called a 
polaron. The displacement of the ions increases the effective inertia and, hence, the effective 
mass of the electron; in KCl the mass is increased by a factor of 2.5 with respect to the band theory 
mass in a rigid lattice. In extreme situations, often with holes, the particle can become self- 
trapped (localized) in the lattice. In covalent crystals the forces on the atoms from the electron are 
weaker than in ionic crystals, so that polaron deformations are small in covalent crystals. 


It is common to speak of large and small polarons. The electron associated 
with a large polaron moves in a band, but the mass is slightly enhanced; these 
are the polarons we have discussed above. The electron associated with a small 
polaron spends most of its time trapped on a single ion. At high temperatures 
the electron moves from site to site by thermally activated hopping; at low 
temperatures the electron tunnels slowly through the crystal, as if in a band of 
large effective mass. 




422 


Table 4 Polaron coupling constants a , masses trip o! , and band masses m* for 
electrons in the conduction band 


Crystal 

KC1 

KBr 

AgCI 

AgBr 

ZnO 

PbS 

InSb 

GaAs 

a 

3.97 

3.52 

2.00 

1.69 

0.85 

0.16 

0.014 

0.06 

m tm 

1.25 

0.93 

0.51 

0.33 

— 

— 

0.014 

■ 

0.50 

0.43 

0.35 

0.24 

— 

— 

0.014 

— 


2.5 

2.2 

1.5 

1.4 

— 

— 

1.0 

— 


Holes or electrons can become self-trapped by inducing an asymmetric 
local deformation of the lattice. This is most likely to occur when the band 
edge is degenerate and the crystal is polar (such as an alkali halide or silver 
halide), with strong coupling of the particle to the lattice. The valence band 
edge is more often degenerate than the conduction band edge, so that holes 
are more likely to be self-trapped than are electrons. Holes appear to be self- 
trapped in all the alkali and silver halides. 

Ionic solids at room temperature generally have very low conductivities 
for the motion of ions through the crystal, less than 10 6 (ohm -(mi) \ but a 
family of compounds has been reported with conductivities of 0.2 (ohm-cm) _1 
at 20°C. The compounds have the composition MAg 4 l 5 , where M denotes K, 
Rb, or NH 4 . The Ag + ions occupy only a fraction of the equivalent lattice sites 
available, and the ionic conductivity proceeds by the hopping of a silver ion 
from one site to a nearby vacant site. The crystal structures also have parallel 
open channels. 


PEIERLS INSTABILITY OF LINEAR METALS 

Consider a one-dimensional metal with an electron gas filling all conduc- 
tion band orbitals out to the wavevector k h , at absolute zero of temperature. 
Peierls suggested that such a linear metal is unstable with respect to a static 
lattice deformation of wavevector G — 2 k F . Such a deformation creates an en- 
ergy gap at the Fermi surface, thereby lowering the energy of electrons below 
the energy gap, Fig. 20 . The deformation proceeds until limited by the in- 
crease of elastic energy; the equilibrium deformation A is given by the root of 

J^electionic + ^clastic) = 0 • (66) 

Consider the elastic strain A cos 2 k F x. The spatial-average elastic energy 
per unit length is ^elastic = |CA 2 {cos 2 2& F x) = |CA 2 , where C is the force 
constant of the linear metal. We next calculate E t .i CT . trunic ,. Suppose that the ion 
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Energy gap introduced 



Wavevector 


Figure 20 Peierls instability. Electrons with wave- 
vectors near the Fermi surface have their energy 
lowered by a lattice deformation. 


contribution to the lattice potential seen by a conduction electron is propor- 
tional to the deformation: U(x) = 2AA cos 2 k F x. From (7.51) we have 

e K = (h 2 /2m)(kl + K 2 ) ± [ 4(h 2 k 2 F /2m)(h 2 K 2 /2m ) + A 2 A 2 ] 1/2 . (67) 

It is convenient to define 

x K = h 2 K 2 /m ; x F = h 2 k 2 F /m ; x = h 2 Kk F /m . 

We retain the — sign in (67) and form 

_ -A 2 A 

dA (x F x K + A 2 A 2 ) 1/2 ’ 

whence, with dKhr as the number of orbitals per unit length, 


dE 


electronic 


dA 


= —(2A 2 AJTr){k F /x, 


■»/" 
J 0 


dK 


^0 (x F x K + A 2 A 2 ) V2 


: = — (2A 2 A/Tr)(k F /x F ) sinh 1 (x F /AA) 


(x 2 + A 2 A 2 ) m 

We put it all together. The equilibrium deformation is the root of 
|CA — (2A 2 mA/Trh 2 k F ) smh ^Ifdkf/mAA) = 0 . 

The root A that corresponds to the minimum energy is given by 
h 2 kplmAA = sinh(—h 2 k F TrC/4mA 2 ) , 

whence 

|A | A — (2h 2 k F /m) exp(— h 2 k F irC/4mA 2 ) , 
if the argument of the sinh in (68) is S> 1. We assume k F =£ | k max . 


( 68 ) 

( 69 ) 



The result is of the form of the energy gap equation in the BCS theory of 
superconductivity, Chapter 10. The deformation A is a collective effect of all 
the electrons. If we set W = h 2 kp/2m — conduction band width; N( 0) = 
2m/'rrfi 2 k F = density of orbitals at Fermi level; V = 2A Z /C = effective electron- 
electron interaction energy, then we can write (69) as 

\A | A — 4W exp[ — LW(0)V] , (70) 

which is analogous to the BCS energy gap equation. An example of a Peierls 
insulator is TaS 3 . 


SUMMARY 

(In CGS units) 


• The dielectric function may be defined as 


e(a>, K) 


Pext(^K) 


in terms of the applied and induced charge density' components at oi,K. 


• The plasma frequency a> p — [4irne z /e{°°)m] i/2 is the frequency of the uniform 
collective longitudinal oscillation of the electron gas against a background of 
fixed positive ions. It is also the low frequency cutoff for propagation of 
transverse electromagnetic waves in the plasma. 

• The poles of the dielectric function define co T and the zeroes define co L . 

• In a plasma the coulomb interaction is screened; it becomes (q/r) exp(— k s r), 
where the screening length 1 /k s = (e F /6im n e z ) m . 

• A metal-insulator transition may occur when the nearest-neighbor separa- 
tion a becomes of the order of 4 a 0 , where a 0 is the radius of the first Bohr 
orbit in the insulator. The metallic phase exists at smaller values of a. 

• A polariton is a quantum of the coupled TO phonon-photon fields. The cou- 
pling is assured by the Maxwell equations. The spectral region u> T < w < (o L 
is forbidden to electromagnetic wave propagation. 

• The Lyddane-Sachs-Teller relation is <y z /<y 2 r = e(0)/e(oo). 


Problems 

1. Surface plasmons. Consider a semi-infinite plasma on the positive side of the 
plane z = 0. A solution of Laplaces equation V 2 < -p — 0 in the plasma is <Pi{x,z) = 
A cos kx e~ kz , whence E zi — kA cos kx c kz \ E :i = kA sin kx e kz . (a)Show that in the 
vacuum tp 0 (x,z) = A cos kx e ki for z < 0 satisfies the boundary' condition that the 
tangential component of E be continuous at the boundary; that is, find E x0 . 
(b) Note that l) l = e(o>)E f ; D„ = E,,. Show that the boundary condition that the 
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normal component of D be continuous at the boundary requires tbat e(co) = — 1, 
whence from (10) we have the Stern-Ferrell result: 

= (71) 

for the frequency of a surface plasma oscillation. 

2. Interface plasmons. We consider the plane interface z = 0 between metal 1 at 
z > 0 and metal 2 at z < 0. Metal 1 has bulk plasmon frequency aj pl ; metal 2 has 
w p 2 . The dielectric constants in both metals are those of frcc-electron gases. Show 
that surface plasmons associated with the interface have the frequency 

« = + ^ 2 )] 1/2 • 

3. Alfven waves. Consider a solid with an equal concentration n of electrons of mass 

m e and holes of mass m/ t . This situation may arise in a semimetal or in a compen- 
sated semiconductor. Place the solid in a uniform magnetic field B = B z. Intro- 
duce the coordinate £ = x + t y appropriate for circularly polarized motion, with t; 
having time dependence Let w e = eBhn.fi and a> h = e,B/m h c. (a) In CGS units, 
show that = eE '/m/m ' <» + o>j ; £/, = —eE* lm h a>(w — a> h ) are the displacements 
of the electrons and holes in the electric field E~ e~ wt = (E x + iE tJ ) e’ wt . 
(b) Show that the dielectric polarization P + = ne(f h — £ e ) in the regime a> (o„ aj h 
may be written as P 1 = nc 2 (m h + mJE + /B 2 , and the dielectric function e(ui) — 
e ( + 4 ttP + /E + = + 4rrc 2 p/B 2 , where is the dielectric constant of the host lat- 

tice and p = n(m e + iri/J is the mass density’ of tlie carriers. If e ( may be neglected, 
the dispersion relation a/e(to) = c 2 K 2 becomes, for electromagnetic waves propa- 
gating in the z direction, o> 2 = (B 2 /4vp)K 2 . Such waves are known as Alfven waves: 
they propagate with the constant velocity' B/Mirp) 1 ' 2 . If B — 10 kG; n = 10 ls cm' 3 ; 
to = KG 27 g, the velocity is ~10 8 cm s' 1 . Alfven waves have been observed in serni- 
metals and in electron-hole drops in germanium (Chapter 15). 

4. Helicon waves, (a) Employ the method of Problem 3 to treat a specimen with 
only one carrier type, say holes in concentration p, and in the limit w < ai h = 
eB/m h c. Show that e(a>) — 47rpe 2 /m ll axo ll , where D + ((o) = e(ti>)E + (&)). The term e. 
in e has been neglected, (b) Show further that the dispersion relation becomes w = 
( Bc/4irpe)K the helicon dispersion relation, in CGS. For K — 1 can' 1 and B = 
1000 G, estimate the helicon frequency in sodium metal. (The frequency is nega- 
tive; with circular-polarized modes the sign of the frequency refers to the sense of 
the rotation.) 

5. Plasmon mode of a sphere. The frequency of the uniform plasmon mode of a 
sphere is determined by the depolarization field E = — 47rP/3 of a sphere, where 
the polarization P= — ner, with r as the average displacement of the electrons of 
concentration n. Show from F = via that the resonance frequency of the electron 
gas is tv\ — 4Tme' l /3m. Because all electrons participate in the oscillation, such an 
excitation is called a collective excitation or collective mode of the electron gas. 

6. Magnetoplasma frequency. Use the method of Problem 5 to find the frequency 
of the uniform plasmon mode of a sphere placed in a constant uniform magnetic 
field B. Let B he along the Z axis. The solution should go to the cyclotron 
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frequency at,. = eB/mc in one limit and to a> 0 = (4ime 2 /3m) y2 in another limit. Take 
the motion in the xy plane. 

7. Photon branch at low wavevector. (a) Find what (56) becomes when e(°°) 
is taken into account, (b) Show that there is a solution of (55) which at low 
wavevector is a> = cK/Ve( 0), as expected for a photon in a crystal of refractive 
index n 2 = e. 

8. Plasma frequency and electrical conductivity. An organic conductor has been 
found by optical studies to have oi p — 1.80 X 10 is s -1 for the plasma frequency, 
and t — 2.83 X 1 0 5 s for the electron relaxation time at room temperature, 
(a) Calculate the electrical conductivity from these data. The carrier mass is not 
known and is not needed here. Take 6 ( 2 °) = I. Convert the result to units 
[fl cm) -1 , (b) From the crystal and chemical structure, the conduction electron 
concentration is 4.7 X 10 21 cm -3 . Calculate the electron effective mass m* . 

9 . Bulk modulus of the Fermi gas. Show that the contribution of the kinetic en- 
ergy to the bulk modulus of the electron gas at absolute zero is B = |ninc|. It is 
convenient to use (6.60). We can use our result for B to find the velocity of sound, 
which in a compressible fluid is v = (B/p) 1 -, where v — (m/3M) 1/2 v p , in agreement 
with (46). These estimates neglect attractive interactions. 

10 . Response of electron gas. It is sometimes stated erroneously in books on electro- 
magnetism that the static conductivity cr, which in gaussian units has the dimensions 
of a frequency, measures the response frequency of a metal to an electric field sud- 
denly applied. Criticize this statement as it might apply to copper at room tempera- 
ture. Tire resistivity is -lpohm-cm; the electron concentration is 8 X 10 22 cm -3 ; the 
mean free path is -400 A; the Fermi velocity is 1.6 X 10 s cm s -1 . You will not neces- 
sarily need all these data. Give the order of magnitude of the three frequencies cr, 
(o p , and 1 /t that might be relevant in the problem. Set up and solve the problem 
of the response x(t) of the system to an electric field Eit < 0) = 0; E(t > 0) = 1. 
The system is a sheet of copper; the field is applied normal to the sheet. Include the 
damping. Solve the differential equation by elementary methods. 

* 11 . Cap plasmons and the van der Waals interaction. Consider two semi-infinite 
media with plane surfaces z - 0, d. The dielectric function of the identical media 
is e(fti). Show that for surface plasmons symmetrical with respect to the gap the 
frequency must satisfy e(tu) = — tanh (Kd! 2), where K 2 — k 2 + k 2 . The electric po- 
tential will have the form 

tp — f(z) exp(ikjX + ik y y — iwt ) . 

Look for nonretarded solution — that is, solutions of the Laplace equation rather 
than of the wave equation. The sum of the zero-point energy of all gap modes 
is the nonretarded part of the van der Waals attraction between the two 
specimens — see N. G. van Kanipen, B. R. A. Nijboer, and K. Schram, Physics 
Letters 26A, 307 (1968). 


This problem is somewhat difficult. 
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Raman scattering (generic term): 
Brillouin scattering when acoustic 
phonon is involved; polariton 
scattering when optical phonon 
is involved. 



+ for phonon emission (Stokes process) 
- for phonon absorption (anti-Stokes) 



Two phonon creation. 


Electron spectroscopy with x-rays (XPS): 

incident x-ray photon ejects valence or 
core electron from solid. 


Figure X There are many types of experiments in which light interacts with wavelike excitations 
in a crystal. Several absorption processes are illustrated here. 
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CHAPTER 15 : OPTICAL PROCESSES AND EXCITONS 


The dielectric function c(w,K) was introduced in the preceding chapter to 
describe the response of a crystal to an electromagnetic field (Fig. 1). The di- 
electric function depends sensitively on the electronic band structure of a 
crystal, and studies of the dielectric function by optical spectroscopy are very 
useful in the determination of the overall hand structure of a crystal. Indeed, 
optical spectroscopy has developed into the most important experimental tool 
for band structure determination. 

In the infrared, visible, and ultraviolet spectral regions the wavevector of 
the radiation is very small compared with the shortest reciprocal lattice vector, 
and therefore it may usually be taken as zero. We are concerned then with the 
real e' and imaginary e" parts of the dielectric function at zero wavevector; 
e(to) = e'(to) + te"(w), also written as + ie 2 (a>). 

However, the dielectric function is not directly accessible experimentally 
from optical measurements: the directly accessible functions are the reflect- 
ance R(o>), the refractive index n(a>), and the extinction coefficient K((d). Our 
first objective is to relate the experimentally observable quantities to the real 
and imaginary parts of the dielectric function. 


OPTICAL REFLECTANCE 


The optical measurements that give the fullest information on the elec- 
tronic system are measurements of the reflectivity of light at normal incidence 
on single crystals. The reflectivity coefficient r(cu) is a complex function de- 
fined at the crystal surface as the ratio of the reflected electric field E(rcfl) to 
the incident electric field E(inc): 


E(refl)/E(inc) = r(w) = p(co) exp[»0(w)] , (1) 


where we have separated the amplitude p(a>) and phase <?(«) components of 
the reflectivity coefficient. 

The refractive index n(o)) and the extinction coefficient K(a>) in the 
crystal are related to the reflectivity' at normal incidence by 


r(co) 


n + iK- 1 
n + iK + 1 ’ 


( 2 ) 


as derived in Problem 3 by elementary consideration of the continuity of the 
components of E and B parallel to the crystal surface. By definition, n(o>) and 
K(a>) are related to the dielectric function e(ta) by 

Vefw) = n(co) + iK(co) — N(w) , (3) 
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where N(to) is the complex refractive index. Do not confuse K(w) as used here 
with the symbol for a wavevector. 

J 

If the incident traveling wave has the wavevector k , then the ij component 
of a wave traveling in the x direction is 

f^finc) = E tj n exp[»(fct - wt)\ . (4) 

The transmitted wave in the medium is attenuated because, by the dispersion 
relation for electromagnetic waves, the wavevector in the medium is related to 
the incident k in vacuum by (n + iK)k\ 

E y (trans) oc exp {[i[(n + iK)kx - wt ]} = exp(-Kfct) exp[i{nfcc - <at)] . (5) 

One quantity measured in experiments is the reflectance R, defined as 
the ratio of the reflected intensity to the incident intensity: 

R = £*(refl)F(refl)/E*(inc)E(inc) = r*r = p 2 . (6) 

It is difficult to measure the phase Oioj) of the reflected wave, but we show 
below that it can be calculated from the measured reflectance R((o ) if this is 
known at all frequencies. 

Once we know both R(w) and 8(a>), we can proceed by (2) to obtain n(w) 
and K(a>). We use these in (3) to obtain e(ca) = e'(ci>) + ie"(co ), where e'(<o ) and 
e"(<u) are the real and imaginary parts of the dielectric function. The inversion 
of (3) gives 

e'(a>) = n 2 - K 2 ; e"(co) = 2nK , (7) 

We now show how to find the phase B(w) as an integral over the re- 
flectance ft(cu); by a similar method we relate the real and imaginary parts of 
the dielectric function. In this way we can find everything from the experi- 
mental R(io). 

Kramers-Kronig Relations 

The Kramers-Kronig relations enable us to find the real part of the re- 
sponse of a linear passive system if we know the imaginary part of the response 
at all frequencies, and vice versa. They are central to the analysis of optical 
experiments on solids. 

The response of any linear passive system can be represented as the su- 
perposition of the responses of a collection of damped harmonic oscillators 
with masses Mj. Let the response function a(a)) = a'(oj) + i<x"(a> ) of the col- 
lection of oscillators be defined by 

x,„ = a(<o)F,„ , (8) 

where the applied force field is the real part of F 0J exp (—iatt) and the total dis- 
placement x = ^ Xj is the real part of x a exp (—icot). From the equation of motion, 

i 

Mj(d 2 /dt 2 + pjd/dt + (of)xj — F , 
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we have the complex response function of the oscillator system: 


*(w) = 2 


fj 


<Oj — w 2 


w Pj 




OJ + Uopj 


(wj — co 2 ) 2 + arpf 


(9) 


where the constants / = \/M j and relaxation frequencies pj are all positive for 
a passive system. 

If a(co) is the dielectric polarizability of atoms in concentration u, then/ 
has the form of an oscillator strength times ne 2 /m- such a dielectric response 
function is said to be of the Kramers-Heisenberg form. The relations wc de- 
velop also apply to the electrical conductivity u{(o) in Ohm’s law, j a = a(w)E u . 

We need not assume the specific form (9), but wc make use of three prop- 
erties of the response function viewed as a function of the complex variable co. 
Any function with the following properties will satisfy the Kramers-Kronig 
relations (11): 

(a) The poles of a (co) are all below the real axis. 

(b) The integral of a(co)/co vanishes when taken around an infinite semi- 
circle in the upper half of the complex to-plane. It suffices that a(co') — > 0 uni- 
formly as I ml — t =o. 

(c) The function a’(co) is even and a”(<o) is odd with respect to real co. 

Consider the Cauchy integral in the form 

= p J_ a> ^l cls ■ (10) 


where P denotes the principal part of the integral, as discussed in the mathe- 
matical note that follows. The right-hand side is to be completed by an integral 
over the semicircle at infinity in the upper half-plane, but we have seen in (b) 
that this integral vanishes. 

We equate the real parts of (10) to obtain 



a"(s) 

S — CO 




a"(s) 

S— CO 


ds + 


0 a"(p) 

— oc r 


In the last integral we substitute s for — p and use property (c) that a"(—s) = 
— a"(.s); this integral then becomes 


and we have, with 



«"(s) 
s + co 


ds , 


1 + 1 = 2s 

S — to S + ft) s 2 - co 2 
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the result 



(Ha) 


This is one of the Kramers-Kronig relations. The other relation follows on 
equating the imaginary parts of Eq. (10): 


a"{<a) = 
whence 


-ip 

7T I S “ 

J CO 


“ «'(s) , 1 p 

C O ds= -Tt? 


r a’(s) [°° a'(s 






r «'(») 

•to s 2 — w 2 


ds . 


(lib) 


These relations are applied below to the analysis of optical reflectance data; 
this is their most important application. 

Let us apply the Kramers-Kronig relations to r(&>) viewed as a response func- 
tion between the incident and reflected waves in (1) and (6). We apply (11) to 

In r(w) = In R^w) + id(co) (12) 


to obtain the phase in terms of the reflectance: 

In R(s) 


0(iu) = -fP 


f 

J a 


o s 2 - ta- 


ds . 


(13) 


We integrate by parts to obtain a form that gives insight into the contribu 
tions to the phase angle: 

I d In R{s 


6((d) ~ - 


=~l r 

2tt J 0 


f\ 

S + 10 

Jo 

S CO 


ds 


-ds . 


(14) 


Spectral regions in which the reflectance is constant do not contribute to the 
integral; further, spectral regions s > w and s < <o do not contribute much be- 
cause the function In l(s + co)/(s — <w)l is small in these regions. 

Mathematical Note. To obtain the Cauchy integral (10) we take the inte- 
gral /«(.?)(« — <u) 1 ds over the contour in Fig. 2. The function a(s) is analytic in 
the upper half-plane, so that the value of the integral is zero. The contribution 
of segment (4) to the integral vanishes if the integrand a(s)/s — > 0 is faster than 
Is I -1 as Isl — > oo. For the response function (9) the integrand — > 0 as Isl -3 : and 
for the conductivity cr(s) the integrand — t 0 as Isl -2 . The segment (2) con- 
tributes, in the limit as it — t 0, 


f a(s) 

J( 2) S_W 


ds 



m e 


w dQ 


u e 
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Figure 2 Contour for the Cauchy principal value 
integral. 


to the integral, where 5 = w + u e’ 9 . The segments (1) and (3) are by definition 
the principal part of the integral between and °° . Because the integral 
over (1) + (2) + (3) + (4) must vanish, 

f + f = P f = > (15) 

J (1) J(3) 

as in (10). 


EXAMPLE: Conductivity of Collisionless Electron Gas. Consider a gas of free 
electrons in the limit as the collision frequency goes to zero. From (9) the response 
function is, with/ = 1/m, 


a(w) 


J_ 1 

nuo p—o to + ip 


1 

nuo 


^ - inS(to) 


(16) 


hy the Dirac identity. We confirm that the delta function in (16) satisfies the Kramers- 
Kronig relation (11a), by which 


a '(a) 


2 _ )_, = __1 
m Jo s 2 - to 2 ' ma 


in agreement with (16). 

We obtain the electrical conductivity a(to) from the dielectric function 


(17) 


e(a>) — 1 — 4 TrPJE a — —4 rrnexJE^ = 47 me 2 a((o) , (18) 

w r here cr(to) = xJ(—e)E a is the response function. We use the equivalence 
(CCS) rr(to) = (-ito/47r)|e(to) - 1] , (19) 


for the Maxwell equation can be written either as c curl H = 47rcr(to)E — icoE or as 
c curl H = — itoe(to)E. We combine (16), (18), and (19) to find the conductivity of a 
collisionless electron gas: 


tr'(to) + i<r"(to) = 


7rS(to) + j- 


( 20 ) 


For collisionless electrons the real part of the conductivity has a delta function at to = 0. 
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Electronic Interband Transitions 

It came as a surprise that optical spectroscopy developed as an important 
experimental tool for the determination of hand structure. First, the absorp- 
tion and reflection bands of crystals are broad and apparently featureless func- 
tions of the photon energy when this is greater than the band gap. Second, 
direct interband absorption of a photon fiu) will occur at all points in the Bril- 
louin zone for which energy is conserved: 

hw = e c (k) - e„(k) , (21) 

where c is an empty' band and v is a filled band. The total absorption at given u> 
is an integral over all transitions in the zone that satisfy (21). 

Three factors unraveled the spectra: 

• The broad bands are not like a spectral line greatly broadened by damping, 
but the bands convey much intelligence which emerges when derivatives are 
taken of the reflectance (Fig. 3); derivatives with respect to wavelength, 
electric field, temperature, pressure, or uniaxial stress, for example. The 
spectroscopy of derivatives is called modulation spectroscopy. 



Figure 3 Comparison of (a) reflectance, (b) wavelength derivative reflectance (first derivative), 
and (c) electroreflectance (third derivative), of the spectral region in germanium between 3.0 and 
3.6 eV. (After data by D. D. Sell, E. O. Kane, and D. E. Aspnes.) 
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• The relation (21) does not exclude spectral structure in a crystal, because 
transitions accumulate at frequencies for which the bands c, v are parallel — 
that is, at frequencies where 

V k [e c (k) - e 0 (k)] = 0 . (22) 

At these critical points in k space the joint density of states D c (e v + 
hco)D^(e v ) is singular, according to the same argument we used in (5.37) to 
show that the density of phonon modes D{a>) is singular when V k w is zero. 

• The pseudopotential method for calculating energy bands helps identify the 
positions in the Brillouin zone of the critical points found in modulation 
spectra. Band-band energy differences can be calculated with an accuracy as 
good as 0.1 eV. The experimental results can then be fed back to give im- 
provements in the pseudopotential calculations. 


EXCITONS 

Reflectance and absorption spectra often show structure for photon 
energies just below the energy gap, where we might expect the crystal to be 
transparent. This structure is caused by the absorption of a photon with the 
creation of a bound electron-hole pair. An electron and a hole may be bound 
together by their attractive coulomb interaction, just as an electron is bound to 
a proton to form a neutral hydrogen atom. 

The bound electron-hole pair is called an exciton. Fig. 4. An exciton can 
move through the crystal and transport energy; it does not transport charge 



Figure 4a An exciton is a bound electron-hole pair, 
usually free to move together through the crystal. In 
some respects it is similar to an atom of positronium, 
formed from a positron and an electron. The exciton 
shown is a Mott-Wannier exciton: it is weakly bound, 
with an average electron-hole distance large in com- 
parison with the lattice constant. 



Figure 4b A tightly-bound or Frenkel exciton shown local- 
ized on one atom in an alkali halide crystal. An ideal F renkel 
exciton will travel as a wave throughout the crystal, but the 
electron is always close to the hole. 
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because it is electrically neutral. It is similar to positroninm, which is formed 
from an electron and a positron. 

Excitons can be formed in every insulating crystal. When the band gap is 
indirect, excitons near the direct gap may be unstable with respect to decay 
into a free electron and free hole. All excitons are unstable with respect to the 
ultimate recombination process in which the electron drops into the hole. Ex- 
citons can also form complexes, such as a hiexciton from two excitons. 

We have seen that a free electron and free hole are created whenever a 
photon of energy greater than the energy gap is absorbed in a crystal. The 
threshold for this process is fioi > £„ in a direct process. In the indirect 
phonon-assisted process of Chapter 8 the threshold is lower by the phonon en- 
ergy hCi. But in the formation of excitons the energy is lowered with respect to 
these thresholds by the binding energy of the exciton, which may be in the 
range 1 meV to 1 eV, as in Table 1 . 

Excitons can be formed by photon absorption at any critical point (22), for 
if V k e c = V k € c the group velocities of electron and hole are equal and the parti- 
cles may be bound by their coulomb attraction. Transitions leading to the 
formation of excitons below the energy gap are indicated in Figs. 5 and 6. 

The binding energy' of the exciton can be measured in three ways: 

• In optical transitions from the valence band, by the difference between the 
energy' required to create an exciton and the energy to create a free electron 
and free hole. Fig. 7. 

• In recombination luminescence, by comparison of the energy of the free 
electron-hole recombination line with the energy of the exciton recombina- 
tion line. 

• By' photo-ionization of excitons, to form free carriers. This experiment re- 
quires a high concentration of excitons. 

We discuss excitons in two different limiting approximations, one by Frenkel 
in which the exciton is small and tightly bound, and the other by Mott and 
Wannier in w'hich die exciton is weakly bound, with an electron-hole separation 
large in comparison with a lattice constant. Intermediate examples are known. 


Table I Binding energy of excitons, in meV 


Si 

14.7 

BaO 

56. 

RbCl 

440. 

Ge 

4.15 

InP 

4.0 

LiF 

(1000) 

GaAs 

4.2 

InSb 

(0.4) 

AgBr 

20. 

GaP 

3.5 

KI 

480. 

AgCl 

30. 

CdS 

29. 

KC1 

400. 

T1C1 

11. 

CdSe 

15. 

KBr 

400. 

TIBr 

6. 


Data assembled by Frederick C. Brown and Arnold Schmidt. 
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Figure 5 Exciton levels in relation to the conduction band edge, for a simple band structure with 
both conduction and valence band edges at k = 0. An exciton can have translational kinetic en- 
ergy. Excitons are unstable with respect to radiative recombination in which the electron drops 
into the hole in the valence band, accompanied by the emission of a photon or phonons. 


Conduction band continuum 



Figure 6 Energy levels of an exciton created in a direct process. Optical transitions from the top 
of the valence band are shown by the arrows; the longest arrow corresponds to the energy gap. 
The binding energy of the exciton is E ex , referred to a free electron and free hole. The lowest fre- 
quency absorption line of the crystal at absolute zero is not E ex> but is — E„. 


Frenkel Excitons 

In a tightly bound exciton (Fig. 4b) the excitation is localized on or near a 
single atom: the hole is usually on the same atom as the electron although the 
pair may be anywhere in the crystal. A Frenkel exciton is essentially an excited 
state of a single atom, but the excitation can hop from one atom to another by 
virtue of the coupling between neighbors. The excitation wave travels through 
the crystal much as the reversed spin of a magnon travels through the crystal. 

The crystalline inert gases have excitons which in their ground states cor- 
respond somewhat to the Frenkel model. Atomic krypton has its lowest strong 
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Figure 7 Effect of an exciton level on the optical absorption of a semiconductor for photons of 
energy near the band gap F. g in gallium arsenide at 21 K. The vertical scale is the intensity absorp- 
tion coefficient a, as in Z(x) = J 0 exp(— ax). The energy gap and exciton binding energy are 
deduced from the shape of the absorption curve: the gap E g is 1.521 eV and the exciton binding 
energy is 0.0034 eV. (After M. D, Sturge.) 


atomic transition at 9.99 eV. The corresponding transition in the crystal is 
closely equal and is at 10.17 eV, Fig. 8. The energy gap in the crystal is 11.7 eV, 
so the exciton ground state energy is 11.7 — 10.17 = 1.5 eV, referred to a free 
electron and free hole separated and at rest in the ciystal. 

The translational states of Frenkel excitons have the form of propagating 
waves, like all other excitations in a periodic structure. Consider a crystal of 
N atoms on a line or ring. If u, is the ground state of atomy', the ground state of 
the crystal is 

4*g = U^U-2 ‘ * * Up] _ yUjY , (23) 

if interactions between the atoms are neglected. If a single atom j is in an 
excited state Vj, the system is described by 

<Pj = u,u 2 ■ ■ ■ Uj^vjUj+t ■ ■ ■ u N . (24) 

This function has the same energy' as the function <p ; with any other atom 1 ex- 
cited. However, the functions <p that describe a single excited atom and 
IV — 1 atoms in their ground state are not the stationary quantum states of the 
problem. If there is any interaction between an excited atom and a nearby 
atom in its ground state, the excitation energy' will be passed from atom to 
atom. The eigenstates will have a wavelike form, as we now show. 
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13.0 12.0 11.0 10.0 

h<i>, eV 

Figure 8 Absorption spectrum of solid lcrypton at 20 K. (After G. Baldini.) 


When the hamiltonian of the system operates on the function cpj with the 
jth atom excited, we obtain 

W<Pf = e<pj + T(<pj- L + <p j+1 ) , (25) 

where e is the free atom excitation energy; the interaction T measures the rate 
of transfer of the excitation from j to its nearest neighbors, j — 1 and 
j + 1. Tbe solutions of (25) are waves of the Bloch form: 

= S exp (ijka) <pj . (26) 

j 

To see this we let operate on t)j k : 

Mh = 2 = 2 e ^ [etpj + n<P,-i + «Pj+i)] , (27) 

J J 

from (25). We rearrange the right-hand side to obtain 

= 2 e iJka U + T(e ika + e _,fe, )]<p ; = (e + 2 T cos ka)*p k , (28) 

j 

so that the energy eigenvalues of the problem are 


E k — e + 2T cos ka , 


(29) 
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Energy 



Figure 9 Energy versus wavevector for a Frenkel exciton, calculated with positive nearest- 
neighbor transfer interaction T. 


as in Fig. 9. The application of periodic boundary conditions determines the 
allowed values of the wavevector k: 

k = 2ws/Na- s = N, -±A r + 1, ■ • • , §N - 1 . (30) 

.Alkali Halides. In alkali halide crystals the lowest-energy excitons are local- 
ized on the negative halogen ions, as in Fig, 4b. The negative ions have lower 
electronic excitation levels than do the positive ions. Pure alkali halide crystals 
are transparent in the visible spectral region, which means that the exciton en- 
ergies do not lie in the visible, but the crystals show considerable excitonic 
absorption structure in the vacuum ultraviolet. 

A doublet structure is particularly evident in sodium bromide, a structure 
similar to that of the lowest excited state of the krypton atom — which is iso- 
electronic with the Br~ ion of KBr. The splitting is caused by the spin-orbit in- 
teraction. These excitons are Frenkel excitons. 

Molecular Crystals. In molecular crystals the covalent binding within a 
molecule is strong in comparison with the van der Waals binding between mol- 
ecules, so that the excitons are Frenkel excitons. Electronic excitation lines of 
an individual molecule appear in the crystalline solid as an exciton, often with 
little shift in frequency. At low' temperatures the lines in the solid are quite 
sharp, although there may be more structure to the lines in the solid than in 
the molecule because of the Davydov splitting, as discussed in Problem 7. 
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Weakly Bound (Mott-Wannier) Excitons 

Consider an electron in the conduction band and a hole in the valence 
band. The electron and hole attract each other by the coulomb potential 

(CCS) U(r) = -e 2 /er , (31) 

where r is the distance between the particles and e is the appropriate dielec- 
tric constant. (The lattice polarization contribution to the dielectric constant 
should not be included if the frequency of motion of the exciton is higher than 
the optical phonon frequencies.) There will be bound states of the exciton sys- 
tem having total energies lower than the bottom of the conduction band. 

The problem is the hydrogen atom problem if the energy surfaces for the 
electron and hole are spherical and nondegenerate. The energy levels referred 
to the top of the valence band are given by a modified Rydberg equation 


(CGS) 


‘ g 2 hW 


(32) 


Here n is the principal quantum number and /a is the reduced mass: 

I = JL + -i- 

M m e m h ’ 


(33) 


formed from the effective masses m,„ m h of the electron and hole. 

The exciton ground state energy is obtained on setting n = 1 in (32); this 
is the ionization energy of the exciton. Studies of the optical absorption lines 
in cuprous oxide, Cu a O, at low temperatures give results for the exciton level 
spacing in good agreement with the Rydberg equation (32) except for transi- 
tions to the state n = 1. An empirical fit to the lines of Fig. 10 is obtained with 
the relation u(cm _1 ) = 17,508 — (800/n 2 ). Taking e = 10, we find ji —0.7 m 
from the coefficient of 1/n 2 . The constant term 17,508 cm 1 corresponds to an 
energy' gap E g = 2.17 eV. 


Exciton Condensation into Electron-Hole Drops (EHD) 

A condensed phase of an electron-hole plasma forms in Ge and Si when 
maintained at a low temperature and irradiated by light. The following sequence 
of events takes place when an electron-hole drop (EHD) is formed in Ge: The 
absorption of a photon of energy fuo > E„ produces a free electron and free hole, 
with high efficiency. These combine rapidly, perhaps in 1 ns, to form an exciton. 
The exciton may decay with annihilation of the e-h pair with a lifetime of 8 /as. 

If the exciton concentration is sufficiently high — over 10 13 cm -3 at 2 K — 
most of the excitons will condense into a drop. The drop lifetime is 40 /as, but 
in strained Ge may be as long as 600 /as. Within the drop the excitons dissolve 
into a degenerate Fermi gas of electrons and holes, with metallic properties: 
this state was predicted by L. V. Keldysh. 
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Photon energy in eV 

2.12 2.13 2.14 2.15 2.16 



Figure 10 Logarithm of the optical transmission versus photon energy in cuprous oxide at 77 K, 
showing a series of exciton lines. Note that on tire vertical axis the logarithm is plotted decreasing 
upward; thus a peak corresponds to absorption. The band gap E g is 2.17 eV. (After P. W. Jlaumeister.) 



fui), in meV 


Figure 11 Recombination radiation of free electrons with holes and of electron-hole drops in Ge 
at 3.04 K. The Fermi energy in the drop is e e and the cohesive energy of the drop with respect to a 
free cxcitonis <]>,. (After T, K. Lo.) 


Figure 11 shows the recombination radiation in Ge from free excitons 
(714 meV) and from the EHD phase (709 meV). The width of the 714 meV 
line is accounted for by Doppler broadening, and the width of the 709 meV 
line is compatible with the kinetic energy distribution of electrons and holes in 
a Fermi gas of concentration 2 X 10 17 cm -3 . Figure 12 is a photograph of a 
large EHD. 
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Figure 12 Photograph of an electron-hole drop in a 4 mm disk of pure germanium. The drop is 
the intense spot adjacent to the set screw on the left of the disk. The photograph is the image of 
the drop obtained by focusing its electron-hole recombination luminescence onto the surface of 
an infrared-sensitive camera. (After J. P. Wolfe et al.) 



Figure 13 Phase diagram for photoexcited electrons and holes in unstressed silicon. The diagram 
shows, for example, that with an average concentration near 10 17 cm -3 at 15 K, a free-exciton gas 
with saturated-gas concentration of 10 16 cm' 3 coexists with a (variable) volume of liquid droplets, 
each with a density of 3 X 10 18 cm -3 . The liquid critical temperature is about 23 K. Theoretical and 
experimental values for the metal-insulator transition for excitons are also shown. (From J. P. Wolfe.) 
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Table 2 Electron-hole liquid parameters 


Crystal 

(unstressed) 

Binding energy relative 
to free exciton, in meV 

Concentration, 
n or p, in cm 3 

Critical 

temperature in K 

Ge 

1.8 

2.57 X 10 17 

6.7 

Si 

9.3 

3.5 X 10 18 

23. 

GaP 

17.5 

8.6 X 10 18 

45. 

3C-SiC 

17. 

10. X 10 18 

41. 


Courtesy of D. Bimberg. 


The exciton phase diagram for silicon is plotted in the temperature- 
concentration plane in Fig. 13. The exciton gas is insulating at low pressures. 
At high pressures (at the right of the diagram) the exciton gas breaks up into a 
conducting plasma of unpaired electrons and holes. The transition from exci- 
tons to the plasma is an example of the Mott transition, Chapter 14. Further 
data are given in Table 2. 


RAMAN EFFECT IN CRYSTALS 

Raman scattering involves two photons — one in, one out — and is one step 
more complex than the one photon processes treated earlier in this chapter. In 
the Raman effect a photon is scattered inelastically by a crystal, with creation 
or annihilation of a phonon or magnon (Fig. 14). The process is identical to the 
inelastic scattering of x-rays and it is similar to the inelastic scattering of 
neutrons by a crystal. 

The selection rides for the first-order Raman effect are 

co = w’±fl ; k = k' ± K , (34) 

where w, k refer to the incident photon; o>', k' refer to the scattered photon; 
and fl, K refer to the phonon created or destroyed in the scattering event. In 
the second-order Raman effect, two phonons are involved in the inelastic 
scattering of the photon . 

The Raman effect is made possible by the strain-dependence of the elec- 
tronic polarizability. To show this, we suppose that the polarizabdity a associ- 
ated with a phonon mode may be written as a power series in the phonon 
amplitude u : 

a = a 0 + a + a 2 u 2 + • ■ • . (35) 

If u(t) = u 0 cos fit and the incident electric field is E(t) = E 0 cos cot, then the 
induced electric dipole moment has a component 

ajEoWo cos cot cos fit — |a-,E 0 M n [cos(<u + ft) l + cos(&> — fl)f] . (36) 
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Figure 14 Raman scattering of a 
photon with emission or absorp- 
tion of a phonon. Similar processes 
occur with magnons (spin waves). 


Thus photons at frequencies w + Cl and a> — can be emitted, accompanied 
by absorption or emission of a phonon of frequency U. 

The photon at o> — fi is called the Stokes line and that at u> + Cl is the 
anti-Stokes line. The intensity of the Stokes line involves the matrix element 
for phonon creation, which is just the matrix element for the harmonic oscilla- 
tor, as in Appendix C: 

l((o — Cl) <x |{n K + l|u|n K )| 2 cc n K + 1 , (37) 

where n K is the initial population of phonon mode K. 

The anti-Stokes line involves phonon annihilation, with a photon intensity 
proportional to 

I((v + fl) a; |(n K - 1 1 u |n K ) | 2 ri K . (38) 

If the phonon population is initially in thermal equilibrium at temperature 
T, the intensity ratio of the two lines is 

I(ce + 0)_ { n K ) 

y, — TTT ~ exp {-nCl/k B T) , (39) 

l(a > — il) (n K ) + 1 

with (n K ) given by the Planck distribution function l/[exp(fi.fl/k B T) — 1], We 
see that the relative intensity of the anti-Stokes lines vanishes as T — » 0, be- 
cause here there are no thermal phonons available to be annihilated. 

Observations on the K = 0 optical phonon in silicon are shown in Figs. 15 
and 1 6. Silicon has two identical atoms in the primitive cell, and there is no 
electric dipole moment associated with the primitive cell in the absence of 
deformation by phonons. But a { u does not vanish for silicon at K = 0, so that 
we can observe the mode by first-order Raman scattering of light. 

The second-order Raman effect arises from the term a 2 « 2 in the polariz- 
ability. Inelastic scattering of light in this order is accompanied by the creation 
of two phonons, or the absorption of two phonons, or the creation of one and 
the absorption of another phonon. The phonons may have different frequen- 
cies. The intensity distribution in the scattered photon spectrum may be quite 
complicated if there are several atoms in the primitive cell because of the 
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Figure 15 First-order Raman spectra of the K — 0 
optical mode of a silicon crystal observed at three 
temperatures. The incident photon has a wavelength 
of 5145 A. The optical phonon frequency is equal to 
the frequency shift; it depends slightly on the temper- 
ature. (After T. R. Hart, R. L. Aggarwal, and R. Lax.) 



Figure 16 Intensity ratio of anti-Stokes to Stokes 
lines as a function of temperature, for the obser- 
vations of Fig. 15 on the optical mcde of silicon. 
The observed temperature dependence is in good 
agreement with the prediction of Eq. (39): the 
solid curve is a plot of the function exp( — hil/k B T). 



Raman shift, cm 1 

Figure 17 Raman spectrum of GaP at 20 K. The two highest peaks are the first-order Raman 
lines associated with the excitation of an LO phonon at 404 cm -1 and a TO phonon at 366 cm -1 . 
All the other peaks involve two phonons. (After M. V. Hobden and J. P. Russell.) 
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corresponding number of optical phonon modes. Second-order Raman spectra 
have been observed and analyzed in numerous crystals. Measurements on 
gallium phosphide (GaP) are shown in Fig. 17. 

Electron Spectroscopy with X-Rays 

The next degree of complexity in optical processes involves a photon in 
and an electron out of the solid, as in Fig. 1. Two important techniques are 
x-ray photoemission from solids (XPS) and ultraviolet photoemission (UPS). In 
solid state physics they are used in band structure studies and surface physics, 
including catalysis and adsorption. 

XPS and UPS spectra can he compared directly with valence band densi- 
ties of states D(e). The specimen is irradiated with highly monochromatic 
x-rays or ultraviolet photons. The photon is absorbed, with the emission of a 
photoelectron whose kinetic energy is equal to the energy of the incident pho- 
ton minus the binding energy of the electron in the solid. The electrons come 
from a thin layer near the surface, typically 50 A in depth. The resolution of 
the best XPS spectrometer systems is less than 10 meV, which permits refined 
studies of band structure. 

The valence band structure of silver is shown by Fig. 18, with the zero of 
energy set at the Fermi level. Electrons in the first 3 eV below’ the Fermi level 
come from the 5s conduction band. The strong peak with structure below 3 eV 
is from the 4 d valence electrons. 

Excitations are also seen from deeper levels, often accompanied by excita- 
tion of plasmons. For example, in silicon the 2 p electron with a binding energy 
close to 99.2 eV is observed in replica at 117 eV with single plasmon excitation 
and at 134.7 eV with two plasmon excitation. The plasmon energy is 18 eV. 



Figure 18 Valence-band electron emission from 
silver, after Siegbahn and co-workers. 
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ENERGY LOSS OF FAST PARTICLES IN A SOLID 

So far we have used photons as probes of the electronic structure of solids. 
We can also use electron beams for the same purpose. The results also involve 
the dielectric function, now through the imaginary part of The dielec- 

tric function enters as Im{e(o>)} into the energy loss by an electromagnetic 
wave in a solid, but as — I rn { 1/e'co) } into the energy loss by a charged particle 
that penetrates a solid. 

Consider this difference. The general result from electromagnetic theory 
for the power dissipation density by dielectric losses is 

(CGS) ~ E • ijSDidt) , (40) 

per unit volume. With a transverse electromagnetic wave Ee~ ,art in the crystal, 
we have dD/dt = -i(o e(w)Ee _, ‘ 0 ', whence the time-average power is 

4 7T 

= - ' - «E*{(e"cos uit - e'sin wt)cos cat) = ~ (oe"(ct))E 2 , (41) 

4 TT 077 

proportional to e"(u>). The tangential component of E is continuous across the 
boundary of the solid. 

If a particle of charge e and velocity v enters a crystal, the dielectric dis- 
placement is, by standard texts, 

(CGS) D(r,t) = -grad ^ — - — - , (42) 

|r — vf | 

because by the Poisson equation it is D, and not E, that is related to the free 
charge. In an isotropic medium the Fourier component E(cu,k) is related to 
the Fourier component D(w,k) of D(r,t) by E(«,k) = E>(&>,k)/e(«,k), 

The time-average power dissipation associated with this Fourier compo- 
nent is 


^(ca.k) = -p-{Re{e '(w.klE^cudOe “'[Ref — tea D(cu,k) e “’*}) 


= - — (o D\u>, k) 

477 



COS (lit + 



sin u)t 


[ — sin tot ] 


whence 


9>(<u,k) 



rf(toM) 


e"(w,k) 
877 W | e | 2 


£> 2 (<u,k) . 


(43) 
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Figure 19 e"(w) for Cu and Au; the bold lines are from energy loss measurements by J. Daniels, and 
the other lines were calculated from optical measurements by D. Beaglehole, and L. R. Canfield ct al. 


The result is the motivation for the introduction of the energy loss function 
— Iin{l/e(ft),k)} and it is also a motivation for experiments on energy losses by 
fast electrons in thin films. 

If the dielectric function is independent of k, the power loss is 

SP(cu) = ~^-\m{l/e((o)}\ri(k 0 v/<o) , (44) 

where hk a is the maximum possible momentum transfer from the primary par- 
ticle to an electron of the crystal. Figure 19 shows the excellent experimental 
agreement between values of e"(a>) deduced from optical reflectivity measure- 
ments with values deduced from electron energy loss measurements. 


SUMMARY 

• The Kramers- Kronig relations connect the real and imaginary parts of a 
response function: 



• The complex refractive index N(w) — n(a>) + iK(o)), where n is the refrac- 
tive index and K is the extinction coefficient; further, e(a>) = N 2 ((o), whence 
e'(o>) — n 2 — K 2 ande"(&>) = 2 nK. 

• The reflectance at normal incidence is 

D _ (n- 1) 2 + K 2 


(n + l) 2 + K 2 ' 
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The energy loss function — Im{l/e(w)} gives the energy loss by a charged 
particle moving in a solid. 


Problems 

1. Causality and the response function. The Kramers-Kronig relations are consis- 
tent with the principle that an effect not precede its cause. Consider a delta- 
function force applied at time t = 0: 

"W-W'IS />“'*« ■ 

whence E„ = 1/2 tt. (a) Show by direct integration or by use of the KK relations that 
the oscillator response function 

a(at) — (o>q — at 2 — iotp)' 1 

gives zero displacement, x(t) = 0, for t < 0 under the above force. For t < 0 
the contour integral may be completed by a semicircle in the upper half-plane, 
(b) Evaluate x(t) for t > 0. Note that a{at) has poles at ±(a>f> — \p 2 ) V2 — lip, both 
in the lower half-plane. 

2. Dissipation sum rule. By comparison of a'((o) from (9) and from (11a) in the limit 

at -4 , show that the following sum rule for the oscillator strengths must hold: 


2fj = f sa”{s) ds . 

J o 

3. Reflection at normal incidence. Consider an electromagnetic wave in vacuum, 
with field components of the form 

E s (inc) = B z ( inc) = Ae i(fa -“ f! . 

Let the wave be incident upon a medium of dielectric constant e and permeability 
p = 1 that fills the half-space x > 0. Show that the reflectivity coefficient r(at) as 
defined by E(refl) — r(&j)E(inc) is given by 


r(t<>) = 


n + iK — 1 
n + iK +1 ’ 


where n + iK = e 1 '", with n and K real. Show further that the reflectance is 


Riot) 


(: n -l ) 2 + K 2 
{n +1) 2 + K 2 


*4. Conductivity sum rule and superconductivity. We write the electrical con- 
ductivity' as cr((o) = cr'(at ) + icr"(a>), where a', cr" are real, (a) Show by a Kramers- 
Kronig relation that 


lim at it" 



o-'(s) ds . 


(45) 


’This problem is somewhat difficult. 
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This result is used in the theory of superconductivity. If at very high frequencies 
(such as x-ray frequencies) tj"(a>) is identical for the superconducting and normal 
states, then we must have 



But at frequencies 0 < to < o>„ within the superconducting energy gap the real part 
of the conductivity of a superconductor vanishes, so that in this region the integral 
on the left-hand side is lower by ~ There must be an additional contribution 
to cr( to balance this deficiency, (b) Show that if cr'(u) < co g ) < cr' : (a> < o> g ), as is ob- 
served experimentally, then <r((w) can have a delta function contribution at o> = 0, 
and from the delta function there is a contribution cr"((o) ~ cr‘„ The delta 
function corresponds to infinite conductivity at zero frequency, (c) By elementary 
consideration of the classical motion of conduction electrons at very high frequen- 
cies, show that 


(CGS) cr'( to) do) — rme 1 /2m , 

J n 

a result found by Ferrell and Glover. 

5. Dielectric constant and the semiconductor energy gap. The effect on e"(a>) 
uf an energy gap o)„ in a semiconductor may be approximated very' roughly by 
substituting §<S(ti> — e>,) for 5(a>) in the response function (16); that is, we take 
e''(o>) = { 2 ttu e 2 lmo) ) tt3 ( u) — o)f). This is crude because it puts all the absorption at 
the gap frequency'. The factor 1/2 enters as soon as we move the delta function 
away from the origin, because the integral in the sum rule of Problem 2 starts at the 
origin. Show that the real part of the dielectric constant on this model is 

e’(w) = 1 + — m 2 ) , to 2 = 4ime a /rn . 

It follows that the static dielectric constant is e'(0) = 1 + lapori, widely used as a 
rule of thumb. 

6. Hagen-Ruhens relation for infrared reflectivity of metals. The complex refrac- 
tive index n + iK of a metal for wt 1 is given by 

(CGS) e(o)) = (n + iK) 2 = 1 + 4mcr a /o) , 

where c,, is the conductivity for static fields. We assume here that intraband 
currents are dominant; interband transitions are neglected. Using the result of 
Problem 3 for the reflection coefficient at normal incidence, show that 

(CGS) h “ 1 -(2 <a/mj- 0 ) la , 

provided that cr 0 §> a>. This is the Hagen-Rubens relation. For sodium at room 
temperature, cr 0 — 2.1 X 10 17 s^ 1 in CGS and t = 3.1 X 10 -14 s, as deduced from 



4S2 


t = o-tfn/ne 1 . Radiation of 10 fim has w = 1.88 X 10 14 s \ so that the Hagen- 
Ruhens result should apply: R = 0.976. The result calculated from experimental val- 
ues of n and K is 0.987. Hint: If a 0 > co, then n 2 — K 2 . This simplifies the algebra. 

*7. Davydov splitting of exciton lines. The Frenkel exciton band of Fig. 9 is doubled 
when there are two atoms A, B in a primitive cell. Extend the theory of Eqs. (25) to 
(29) to a linear crystal AB.AB.AB.AB. with transfer integrals T, between AB and T 2 
between B.A. Find an equation for the two bands as functions of the wavevector. 
The splitting between the bands at k = 0 is called the Davydov splitting. 


’This problem is somewhat difficult. 
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NOTATION: 

(CGS) 

(SI) 


a = p/£locaI ; 

& p/E local ’ 

6 cgs = e si ; 4'77 / ^ cgs = ^ S1 ; a SI = 47 re 0 a CG s • 


e 0 = 1074m: 2 ; 

D = E + 47 rP = eE = (1 + 4ttx)E ; 
D = €qE + F = €€oE = ( 14 - xUqE ; 


Figure 1 The permanent dipole moment of a molecule of water has the 
magnitude 1.9 X 10~ 18 esu-cm and is directed from the O 2 * ion toward 
the midpoint of the line connecting the H + ions. (To convert to SI units, 
multiply p by 5 X 10 11 .) 




Figure 2 Electrostatic potential and field components in CGS at position r, 8 for a dipole p 
directed along the 2 axis. For 8 = 0, we have E s = E y = 0 and E. = 2 p/r 3 ; for 8 = tt/2 we have 
E, = E y = 0 and E- = —p/r 3 . To convert to SI, replace p by p/4we 0 . (After E. M. Purcell.) 
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CHAPTER 16: DIELECTRICS AND FERROELECTRICS 


First we relate the applied electric field to the internal electric field in a 
dielectric crystal. The study of the electric field within dielectric matter arises 
when we ask: 

• What is the relation in the material between the dielectric polarization P 
and the macroscopic electric field E in the Maxwell equations? 

• What is the relation between the dielectric polarization and the local 
electric field which acts at the site of an atom in the lattice? The local field 
determines the dipole moment of the atom. 

Maxwell Equations 

(CGS) 

curl H = j + \ (E + 4ttP) ; 

1P _ 1 <3B 

curlE - - clF ; 

div E = 47rp ; 
div B = 0 ; 

Polarization 

The polarization P is defined as the dipole moment per unit 
volume, averaged over the volume of a cell. The total dipole moment is 
defined as 


(SI) 

curlH=j + |(e 0 E + P) ; 

curlE--f ; 
div e 0 E = p ; 
div B = 0 . 


P = 2q„r„ , 


( 1 ) 


where r„ is the position vector of the charge q n . The value of the sum will be 
independent of the origin chosen for the position vectors, provided that the 
system is neutral: Let r' = r„ + R; then p = r' = RXq„ + Xq„r„ = 2t/„r„. 
The dipole moment of a water molecule is shown in Fig. 1. 

The electric field at a point r from a dipole moment p is given by a stan- 
dard result of elementary electrostatics: 


(CGS) E(r) = 


3(p • r)r - r 2 p 



( 2 ) 


The lines of force of a dipole pointing along the z axis are shown in Fig. 2. 
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MACROSCOPIC ELECTRIC FIELD 


One contribution to the electric field inside a body is that of the applied 
electric field, defined as 


E 0 = field produced by fixed charges external to the body . 


(3) 


The other contribution to the electric field is the sum of the fields of all 
charges that constitute the body. If the body is neutral, the contribution to the 
average field may be expressed in terms of the sum of the fields of atomic 
dipoles. 

We define the average electric field E(r 0 ) as the average field over the 
volume of the crystal cell that contains the lattice point r 0 : 

E(r 0 ) = J dV e(r) , (4) 


where e(r) is the microscopic electric field at the point r. The field E is a 
much smoother quantity than the microscopic field e. We could well have 
written the dipole field (2) as e(r) because it is a microscopic unsinoothed 
field. 

We call E the macroscopic electric field. It is adequate for all problems 
in the electrodynamics of crystals provided that we know the connection be- 
tween E, the polarization P, and the current density j, and provided that the 
wavelengths of interest are long in comparison with the lattice spacing. 1 

To find the contribution of the polarization to the macroscopic field, we 
can simplify the sum over all the dipoles in the specimen. By a famous theo- 
rem of electrostatics 2 the macroscopic electric field caused by a uniform polar- 
ization is equal to the electric field in vacuum of a fictitious surface charge 


‘A detailed derivation of the Maxwell equations for the macroscopic: fields E and B, starting 
from tlie Maxwell equations in terms of the microscopic fields e and h, is given by E. M. Purcell, 
Electricity and magnetism, 2nd ed., McGraw-Hill, 1985. 

2 The electrostatic potential in CGS units of a dipole p is <r(r) = p • grad(l/r). For a volume 
distribution of polarization P we have 

<p(r) = J dV • grad ^ j , 

which by a vector identity becomes 

■p(r) = J dV f-p div P + div j 

if P is constant, then div P = 0 and by the Gauss theorem we have 

<p(r) = J d.S-p-= I dSj, 

where crdS is an element of charge on the surface of the body. This completes the proof. 
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Figure 3 (a) A uniformly polarized dielectric slab, with the polarization vector P normal to the 
plane of the slab, (b) A pair of uniformly charged parallel plates which give rise to the identical 
electric field as in (a). The upper plate has the surface charge density a = +P, and the lower 
plate has u = —P. 


density a = n • P on the surface of the body. Here n is the unit normal to the 
surface, drawn outward from the polarized matter. 

We apply the result to a thin dielectric slab (Fig. 3a) with a uniform vol- 
ume polarization P. The electric field Ej(r) produced by the polarization is 
equal to the field produced by the fictitious surface charge density cr = n ■ P 
on the surface of the slab. On the upper boundary the unit vector n is directed 
upward and on the lower boundary n is directed downward. The upper bound- 
ary bears the fictitious charge a = n • P = P per unit area, and the lower 
boundary bears — P per unit area. 

The electric field Ej due to these charges has a simple form at any point 
between the plates, but comfortably removed from their edges. By Gauss’s law 

(CGS) Ej = — 4 ttH = —4nP ; (SI) Ej = - = ~ . (4a) 

We add Ei to the applied field E 0 to obtain the total macroscopic field 
inside the slab, with z the unit vector normal to the plane of the slab: 

(CGS) E = E 0 + Ej = E 0 — 4ttPz ; (5) 

(SI) E = E 0 + E 1 = E 0 -^z . 

We define 


E] = field of the surface charge denisty n • P on the boundary . 


( 6 ) 


This field is smoothly varying in space inside and outside the body and satisfies 
the Maxwell equations as written for the macroscopic field E. The reason E x is 
a smooth function when viewed on an atomic scale is that we have 
replaced the discrete lattice of dipoles p ( with the smoothed polarization P. 
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Depolarization Field, 

If the polarization is uniform within the body, the only contributions to the 
macroscopic field are from E 0 and Ep 


E = E 0 + E x . 


(7) 


Here E 0 is the applied field and E! is the field due to the uniform polarization. 

The field Ei is called the depolarization field, for within the body it 
tends to oppose the applied field E 0 as in Fig. 4. Specimens in the shape of 
ellipsoids, a class that includes spheres, cylinders, and discs as limiting forms, 
have an advantageous property: a uniform polarization produces a uniform de- 
polarization field inside the body. This is a famous mathematical result demon- 
strated in classic texts on electricity and magnetism. 3 

If P x , P y , P z are the components of the polarization P referred to the principal 
axes of an ellipsoid, then the components of the depolarization field are written 


(CGS) 

£n = ~N X P X ; 


-N P ■ 

x y* y * 


-N Z P Z 

(SI) 

p 

£ ' Lt e 0 ’ 

II 

fcs? 

N P 

‘rs 

£n = 

. 

N Z P Z 

Co 


( 8 ) 


Here N x , N y , N. are the depolarization factors; their values depend on the 
ratios of the principal axes of the ellipsoid. The N’s are positive and satisfy the 
sum rule N x + N y + N. = 4n in CGS, and N x + N y + N, = 1 in SI. 

Values of N parallel to the figure axis of ellipsoids of revolution are plotted 
in Fig. 5. Additional cases have been calculated by Osborn 4 and by Stoner. In 
limiting cases N has the values: 


Shape 

Axis 

N 

(CGS) 

N 

(SI) 

Sphere 

any 

4tt/3 

1/3 

Thin slab 

normal 

477 

1 

Thin slab 

in plane 

0 

0 

Long circular cylinder 

longitudinal 

0 

0 

Long circular cylinder 

transverse 

27 r 

1/2 


We can reduce the depolarization field to zero in two ways, either by working 
parallel to the axis of a long fine specimen or by making an electrical connection 
between electrodes deposited on the opposite surfaces of a thin slab. 


3 R. Becker, Electromagnetic fields and interactions, Blaisdell, 1964, pp. 102-107. 

4 J. A. Osborn, Phys. Rev. 67, 351 (1945); E. C. Stoner, Philosophical Magazine 36, 803 (1945). 



16 Dielectrics and Ferroelectrics 


459 



Figure 4 The depolarization field E x is op- 
posite to P. The fictitious surface charges are 
indicated: the field of these charges is Ej 
within the ellipsoid. 



Figure 5 Depolarization factor N 
parallel to the figure axis of ellip- 
soids of revolution, as a function 
of the axial ratio c/a. 


A uniform applied field E 0 will induce uniform polarization in an ellipsoid. 
We introduce the dielectric susceptibility x such that the relations 

(CGS) P = *E ; (SI) P = eo*E , (9) 

connect the macroscopic field E inside the ellipsoid with the polarization P. 
Here * S i = 4ir* C GS- 

If E 0 is uniform and parallel to a principal axis of the ellipsoid, then 

(CGS) E = E 0 + E, = E 0 - NP ; (SI) E = E 0 -^~ , (10) 

by (8), whence 

(CGS) P = x(E 0 - NP) ; P = r ^-E 0 ; (11) 

(SI) P = *(eoE 0 - NP) ; P = E 0 . 

The value of the polarization depends on the depolarization factor N. 
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LOCAL ELECTRIC FIELD AT AN ATOM 

The value of the local electric field that acts at the site of an atom is signif- 
icantly different from the value of the macroscopic electric field. We can con- 
vince ourselves of this by consideration of the local field at a site with a cubic 
arrangement of neighbors 5 in a crystal of spherical shape. The macroscopic 
electric field in a sphere is 

(CGS) E = E 0 + E x = E 0 — P ; (12) 

(SI) E = E 0 + E x = E 0 - P , 

by (10). 

But consider the field that acts on the atom at the center of the sphere 
(this atom is not unrepresentative). If all dipoles are parallel to the 3 axis and 
have magnitude p, the z component of the field at the center due to all other 
dipoles is, from (2), 


(CGS) 


E 


dipole 






(13) 


In SI we replace p by p/4ire 0 . The x, y, z directions are equivalent because of 
the symmetry of the lattice and of the sphere; thus 


2 

i 





whence E dipole = 0. 

The correct local field is just equal to the applied field, E loca i = E 0 , for an 
atom site with a cubic environment in a spherical specimen. Thus the local 
field is not the same as the macroscopic average field E. 

We now develop an expression for the local field at a general lattice site, 
not necessarily of cubic symmetry. The local field at an atom is the sum of the 
electric field E 0 from external sources and of the field from the dipoles within 
the specimen. It is convenient to decompose the dipole field so that part of the 
summation over dipoles may be replaced by integration. 

We write 


Elocai — Eq + E t + E 2 + E 3 . 


(14) 


5 Atom sites in a cubic crystal do not necessarily have cubic symmetry: thus the O 2 sites in the 
barium titanate structure of Fig. 10 do not have a cubic environment. However, the Na + and Cl” 
sites in the NaCl structure and the Cs + and Cl” sites in the CsCl structure have cubic symmetry. 
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Figure 6 The internal electric field on an atom in a crystal is the sum of the external applied field 
E n and of the field due to the other atoms in the crystal. The standard method of summing the di- 
pole fields of the other atoms is first to sum individually over a moderate number of neighboring 
atoms inside an imaginary sphere concentric with the reference atom: this defines the field E 3 , 
which vanishes at a reference site with cubic symmetry. The atoms outside the sphere can be 
treated as a uniformly polarized dielectric. Their contribution to the field at the reference point is 
E 1 + E 2 , where E ; is the depolarization field associated with the outer boundary and E 2 is the 
field associated with the surface of the spherical cavity. 


Here 

E 0 = field produced by fixed charges external to the body; 

E 3 = depolarization field, from a surface charge density n • P on the outer 
surface of the specimen; 

E 2 = Lorentz cavity field: field from polarization charges on inside of a 
spherical cavity cut (as a mathematical fiction) out of the specimen with the 
reference atom as center, as in Fig. 6; E 3 + E 2 is the field due to uniform po- 
larization of the body in which a hole has been created; 

E 3 = field of atoms inside cavity. 

The contribution E 3 + E 2 + E 3 to the local field is the total field at one 
atom caused by the dipole moments of all the other atoms in the specimen: 


(CGS) 


e 2 + e 3 = 2 


3(p, • rjr,- - ifp, 


(15) 


and in SI we replace p, by p/47re 0 . 

Dipoles at distances greater than perhaps ten lattice constants from the 
reference site make a smoothly varying contribution to this sum, a contribu- 
tion which may be replaced by two surface integrals. One surface integral is 
taken over the outer surface of the ellipsoidal specimen and defines E 1; as in 
Eq. (6). The second surface integral defines E 2 and may be taken over 
any interior surface that is a suitable distance (say 50 A) from the reference 
site. We count in E 3 any dipoles not included in the volume bounded by 
the inner and outer surfaces. It is convenient to let the interior surface be 
spherical. 
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Figure 7 Calculation of the field in a spherical cavity in Charge on ring = 

a uniformly polarized medium. 27r a sin 0 • a dd • P cos 6 


Lorentz Field, E 2 

The field E 2 due to the polarization charges on the surface of the fictitious 
cavity was calculated by Lorentz. If 0 is the polar angle (Fig. 7) referred to the 
polarization direction, the surface charge density on the surface of the cavity is 
— P cos 6. The electric field at the center of the spherical cavity of radius a is 

(CGS) E 2 = f (a~ 2 )(2va sin 0){ad0){ P cos 0)(cos 0)=^- P ; (16) 

Jo 3 

(SI) E 2 = ^-P. 

Oto 

This is the negative of the depolarization field Ej in a polarized sphere, so that 
E! + E 2 = 0 for a sphere. 

Field of Dipoles Inside Cavity, E 3 

The field E 3 due to the dipoles within the spherical cavity is the only term 
that depends on the crystal structure. We showed for a reference site with 
cubic surroundings in a sphere that E 3 = 0 if all the atoms may be replaced by 
point dipoles parallel to each other. The total local field at a cubic site is, from 
(14) and (16), 

(CGS) E, ocal = E 0 + E 1 +^P = E + ^P ; (17) 

(SI) E w = E + ^P. 

This is the Lorentz relation: the field acting at an atom in a cubic site is the 
macroscopic field E of Eq. (7) plus 47rP/3 or P/3e 0 from the polarization of the 
other atoms in the specimen. Experimental data for cubic ionic crystals sup- 
port the Lorentz relation. 
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DIELECTRIC CONSTANT AND POLARIZABILITY 

The dielectric constant e of an isotropic or cubic medium relative to vac- 
uum is defined in terms of the macroscopic field E: 


(CGS) 

E + AttP , , . 
e - E ~ 1 + 4ttx ; 

(18) 

(SI) 

e 0 E + P , , 

«= , p = 1 +* • 

e 0 E 


Remember that y S | = 47ry CGS , by definition, but e SI = e CGS . 

The susceptibility (9) is related to the dielectric constant by 


(CGS) 

_ P _ e — 1 /OTN _ P _ , 

X E Air ’ (SI) x e^E 6 1 ‘ 

1 ' (19) 

In a noncubic crystal the dielectric response is described by the components 
of the susceptibility tensor or of the dielectric constant tensor: 

(CGS) 

^ XpLV^V > ^pLV ^Vl> * 

(20) 

(SI) 

V* X/AV^qE'V > ^/XV ^ fXV XfXV * 



The polarizability a of an atom is defined in terms of the local electric 
field at the atom: 

V = aE local . (21) 

where p is the dipole moment. This definition applies in CGS and in SI, but 
a SI = 47re 0 a CGS . The polarizability is an atomic property, but the dielectric 
constant will depend on the manner in which the atoms are assembled to form 
a crystal. For a nonspherical atom a will be a tensor. 

The polarization of a crystal may be expressed approximately as the prod- 
uct of the polarizabilities of the atoms times the local electric field: 

P = lN J p j = '2N J a j E loc (j) , ( 22 ) 

J j 

where Nj is the concentration and a ; the polarizability of atoms j, and E\ oc (j) is 
the local field at atom sites j. 

We want to relate the dielectric constant to the polarizabilities; the result 
will depend on the relation that holds between the macroscopic electric field 
and the local electric field. We give the derivation in CGS units and state the 
result in both systems of units. 
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If the local field is given by the Lorentz relation (17), then 


(CGS) 


P = (ZNja,) ( E + 



and we solve for P to find the susceptibility 


(CGS) 



2A 


4 TT 


^Njaj 


( 23 ) 


By definition e = 1 + 47ry in CGS; we may rearrange (23) to obtain 


(CGS) 


6-1 
6 + 2 





e-1 
6 + 2 



(24) 


the Clausius-Mossotti relation. This relates the dielectric constant to the 
electronic polarizability, but only for crystal structures for which the Lorentz 
local field (17) obtains. 


Electronic Polarizability 

The total polarizability may usually be separated into three parts: elec- 
tronic, ionic, and dipolar, as in Fig. 8. The electronic contribution arises from 
the displacement of the electron shell relative to a nucleus. The ionic contri- 
bution comes from the displacement of a charged ion with respect to other 
ions. The dipolar polarizability arises from molecules with a permanent elec- 
tric dipole moment that can change orientation in an applied electric field. 


Total polarizability (real part) 



Figure 8 Frequency dependence of the several contributions to the polarizability. 



16 Dielectrics and Ferroelectrics 


465 


In heterogeneous materials there is usually also an interfacial polarization 
arising from the accumulation of charge at structural interfaces. This is of little 
fundamental interest, but it is of considerable practical interest because com- 
mercial insulating materials are usually heterogeneous. 6 

The dielectric constant at optical frequencies arises almost entirely from 
the electronic polarizability. The dipolar and ionic contributions are small at 
high frequencies because of the inertia of the molecules and ions. In the opti- 
cal range (24) reduces to 

(CGS) ^ 2ALa,( electronic) ; (25) 

n + 2 o 

here we have used the relation n 2 = e, where n is the refractive index. 

By applying (25) to large numbers of crystals we determine in Table 1 em- 
pirical values of the electronic polarizabilities that are reasonably consistent 
with the observed values of the refractive index. The scheme is not entirely 
self-consistent, because the electronic polarizability of an ion depends 
somewhat on the environment in which it is placed. The negative ions are 
highly polarizable because they are large. 


Table 1 Electronic polarizabilities of atoms and ions, in 10 24 cm 3 





He 

Li + 

Be 24 

g3+ 

C 44 

Pauling 



0.201 

0.029 

0.008 

0.003 

0.0013 

JS 




0.029 





O 2 

F 

Ne 

Na + 

Mg 2+ 

ai 3+ 

Si 44 

Pauling 

3.88 

1.04 

0.390 

0.179 

0.094 

0.052 

0.0165 

JS-(TKS) 

(2.4) 

0.858 


0.290 





s 2 - 

cr 

Ar 

K + 

Ca 2+ 

Se 3+ 

Ti 44 

Pauling 

10.2 

3.66 

1.62 

0.83 

0.47 

0.286 

0.185 

JS-(TKS) 

(5.5) 

24)47 


1.133 

(1.1) 


(0.19) 


Se 2 ~ 

Br 

Kr 

Rb + 

Sr 24 

r + 

Zr 4+ 

Pauling 

10.5 

4.77 

2.46 

1.40 

0.86 

0.55 

0.37 

JS-(TKS) 

(7.) 

4.091 


1.679 

(1.6) 




Te 2 " 

r 

Xe 

Cs + 

Ba a+ 

La 3+ 

Ce 44 

Pauling 

14.0 

7.10 

3.99 

2.42 

1.55 

1.04 

0.73 

JS-(TKS) 

(9.) 

6.116 


2.743 

(2.5) 




Values from L. Pauling, Proc. R. Soc. London A114, 181 (1927); S. S. Jaswal and T. P. 
Sharma, J. Phys. Chein. Solids 34, 509 (1973); and J. Tessman, A Kahn, and W. Shockley, Phys. 
Rev. 92, 890 (1953). The TKS polarizabilities are at the frequency of the D lines of sodium. The 
values arc in CGS; to convert to SI, multiply by 9 X 10 5 . 

f, For references see D. E. Aspnes, Am. J. Phys. 50, 704 (1982). 
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Classical Theory of Electronic Polarizability. Ail electron bound har- 
monically to an atom will show resonance absorption at a frequency co Q = 
((3 im) m , where (3 is the force constant. The displacement x of the electron 
occasioned by the application of a field Ej oc is given by 

-eE l0C = /3x = rrmlx , (26) 

so that the static electronic polarizability is 

a(electronic) = p/E ](X - -&r/E loc = e^/rnoif, . (27) 

The electronic polarizability will depend on frequency, and it is shown in 
the following example that for frequency to 

(CGS) a(electronic) = ■ — ; (28) 

0)5 — or 

but in the visible region the frequency dependence (dispersion) is not usually 
very important in most transparent materials. 


EXAMPLE: Frequency dependence. Find the frequency dependence of the elec- 
tronic polarizability of an electron having the resonance frequency w 0 , treating the sys- 
tem as a simple harmonic oscillator. 

The equation of motion in the local electric field E] oc sin tot is 


mCr + ma&c — 

dr 


<?E| oc sin cot 


so that, for x = x 0 sin 


m(-to 2 + <«>„)*„ = ~eE ]oc . 


The dipole moment has the amplitude 

Po ~ ~ ex o 


from which (28) follows. 


m(co o — co 2 ) 


In quantum theory' the expression corresponding to (28) is 

2 f- 

(CGS) a(electronic) = S 2 ' 3 2 , (29) 

j Wy ~ 0) 

where fy is called the oscillator strength of the electric dipole transition be- 
tween the atomic states i and j. Near a transition the polarizability changes 
sign (Fig. 8). 
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STRUCTURAL PHASE TRANSITIONS 

It is not uncommon for crystals to transform from one crystal structure to 
another as the temperature or pressure is varied. The stable structure A at 
absolute zero generally has the lowest accessible internal energy of all the pos- 
sible structures. Even this selection of a structure A can be varied with appli- 
cation of pressure, because a low atomic volume will favor closest-packed or 
even metallic structures. Hydrogen and xenon, for example, become metallic 
under extreme pressure. 

Some other structure B may have a softer or lower frequency phonon 
spectrum than A. As the temperature is increased the phonons in B will be 
more highly excited (higher thermal average occupancies) than the phonons in 
A. Because the entropy increases with the occupancy, the entropy of B will be- 
come higher than the entropy of A as the temperature is increased. 

It is thereby possible for the stable structure to transform from A to B as 
the temperature is increased. The stable structure at a temperature T is deter- 
mined by the minimum of the free energy F = U ~ TS. There will be a transi- 
tion from A to B if a temperature T 0 exists (below the melting point) such that 
F a (T c ) = F b (T c ). 

Often several structures have nearly the same internal energy at absolute 
zero. The phonon dispersion relations for the structures may, however, be rather 
different. The phonon energies are sensitive to the number and arrangement of 
nearby atoms; these are the quantities that change as the structure is changed. 

Some structural phase transitions have only small effects on the macro- 
scopic physical properties of the material. However, if the transition is influ- 
enced by an applied stress, the crystal may yield mechanically quite easily near 
the transition temperature because the relative proportions in the two phases 
will change under stress. Some other structural phase transitions may have 
spectacular effects on the macroscopic electrical properties. 

Ferroelectric transitions are a subgroup of structural phase transitions, a 
subgroup marked by the appearance of a spontaneous dielectric polarization in 
the crystal. Ferroelectrics are of theoretical and technical interest because 
they often have unusually high and unusually temperature-dependent values 
of the dielectric constant, the piezoelectric effect, the pyroelectric effect, and 
electro-optical effects, including optical frequency doubling. 


FERROELECTRIC CRYSTALS 

A ferroelectric crystal exhibits an electric dipole moment even in the ab- 
sence of an external electric field. In the ferroelectric state the center of posi- 
tive charge of the crystal does not coincide with the center of negative charge. 

The plot of polarization versus electric field for the ferroelectric state 
shows a hysteresis loop. A crystal in a normal dielectric state usually does not 
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Figure 9 The temperature variation of (a) the dielectric constant e, (b) the pyroelectric coeffi- 
cient dP/dT, and (c) the specific heat c f , of PbTi0 3 . (After Remeika and Class.) 


show significant hysteresis when the electric field is increased and then re- 
versed, both slowly. 

Ferroelectricity usually disappears above a certain temperature called the 
transition temperature. Above the transition the crystal is said to be in a para- 
electric state. The term paraelectric suggests an analogy with paramagnetism: 
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there is usually a rapid drop in the dielectric constant as the temperature 
increases. 

In some crystals the ferroelectric dipole moment is not changed by an 
electric field of the maximum intensity' which it is possible to apply before 
causing electrical breakdown. In these crystals we are often able to observe a 
change in the spontaneous moment when the temperature is changed (Fig. 9). 
Such crystals are called pyroelectric. Lithium niobate, LiNb0 3 , is pyroelec- 
tric at room temperature. It has a high transition temperature ( T c = 1480 K) 
and a high saturation polarization (50 /rC/cm 2 ). It can be 'poled/’ which 
means given a remanent polarization, by an electric field applied over 1400 K. 

Classification of Ferroelectric Crystals 

We list in Tabic 2 sonic of the crystals commonly considered to be ferroelec- 
tric, along with the transition temperature or Curie point T c at which the crystal 
changes from the low-temperature polarized state to the high-temperature 
unpolarized state. Thermal motion tends to destroy the ferroelectric order. 
Some ferroelectric crystals have no Curie point because they melt before leaving 
the ferroelectric phase. The table also includes values of the spontaneous polar- 
ization P s . Ferroelectric crystals may be classified into two main groups, order- 
disorder or displacive. 

One may define the character of the transition in terms of the dynamics of 
the lowest frequency (“soft”) optical phonon modes. If a soft mode can propa- 
gate in the crystal at the transition, then the transition is displacive. If the soft 
mode is only diffusive (non-propagating) there is really not a phonon at all, 


Table 2 Ferroelectric crystals 


To obtain the spontaneous polarization P v 
value given in /xC cm -2 by 3 X 10 s . 

in the CGS unit of 

esu cm 2 , multiply 

the 



G> in K 

P s , in /rC cm 2 

, atTK 

KDP type 

kh 2 po 4 

123 

4.75 

196] 


kd 2 po 4 

213 

4.83 

[180] 


RbtI 2 P0 4 

147 

5.6 

[90] 


KH 2 As0 4 

97 

5.0 

[78] 


GeTe 

670 

— 

— 

TGS type 

Tri-glycine sulfate 

322 

2.8 

[29] 


Tri-glycine selenate 

295 

3.2 

[283] 

Perovskites 

BaTiO :$ 

408 

26.0 

[296] 


KNbO .5 

708 

30.0 

[523] 


PbTiOj 

765 

>50 

[296] 


T,iTa0 3 

938 

50 



LiNb0 3 

1480 

71 

[296] 
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but is only a large amplitude hopping motion between the wells of the order- 
disorder system. Many ferroelectrics have soft modes that fall between these 
two extremes. 

The order-disorder class of ferroelectrics includes crystals with hydrogen 
bonds in which the motion of the protons is related to the ferroelectric 
properties, as in potassium dihydrogen phosphate (KH 2 P0 4 ) and isomorphous 
salts. The substitution of deuterons for protons nearly doubles T c , although the 
fractional change in the molecular weight of the compound is less than 2 
percent: 

KH 2 P0 4 KD 2 P0 4 KH 2 As0 4 KD,As 0 4 
Curie temperature 123 K 213 K 97 K 162 K 

This extraordinarily large isotope shift is believed to be a quantum effect in- 
volving the mass-dependence of the de Broglie wavelength. Neutron diffraction 
data show that above the Curie temperature the proton distribution along the 
hydrogen bond is symmetrically elongated. Below the Curie temperature the 
distribution is more concentrated and asymmetric with respect to neighboring 
ions, so that one end of the hydrogen bond is preferred by the proton over the 
other end, giving a polarization. 

The displacive class of ferroelectrics includes ionic crystal structures 
closely related to the perovskite and ilmenite structures. The simplest ferro- 
electric crystal is GeTe with the sodium chloride structure. We shall devote 
ourselves primarily to crystals with the perovskite structure, Fig. 10. 

Consider the order of magnitude of the ferroelectric effects in barium 
titanate: the observed saturation polarization P s at room temperature (Fig. 11) 




Figure 10 (a) The crystal structure of barium titanate. The prototype crystal is calcium titanate 
(perovskite). The structure is cubic, with Ba 2+ ions at the cube corners, 0 2 ~ ions at the face cen- 
ters, and a Ti 4+ ion at the body center, (b) Below the Curie temperature the structure is slightly 
deformed, with Ba 2+ and Ti 4+ ions displaced relative to the 0 2 “ ions, thereby developing a dipole 
moment. The upper and lower oxygen ions may move downward slightly. 
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Temperature (°C) 

Figure 11 Spontaneous polarization projected on cube edge of barium titanate, as a function of 
temperature. (After W. J. Merz.) 


is 8 X 10 4 esu cm 2 . The volume of a cell is (4 X 10 8 ) 3 = 64 X 10 24 cm 3 , so 
that the dipole moment of a cell is 

(CGS) p = (8 X 10 4 esu cm~ 2 )(64 X 10 -24 cm -3 ) s5X 10~ 18 esu cm ; 

(SI) p = (3 X 10“ 1 C m“ z )(64 X 10“ 30 m 3 ) = 2 X 10“ 29 C m . 

If the positive ions Ba 2+ and Ti 4+ were moved by 8 = 0.1 A with respect to the 
negative O 2- ions, the dipole moment of a cell would be 6 e8 = 3 X 10“ 18 esu 
cm. In LiNb0 3 the displacements are considerably larger, being 0.9 A and 0.5 A 
for the lithium and niobum ions respectively, giving the larger P s . 


DISPLACIVE TRANSITIONS 

Two viewpoints contribute to an understanding of a ferroelectric displacive 
transition and by extension to displacive transitions in general. We may speak of 
a polarization catastrophe in which for some critical condition the polarization 
or some Fourier component of the polarization becomes very large. Equally, we 
may speak of the condensation of a transverse optical phonon. Here the word 
condensation is to be understood in the Bose-Einstein sense (TP, p. 199) of a 
time-independent displacement of finite amplitude. This can occur when the 
corresponding TO phonon frequency vanishes at some point in the Brillouin 
zone. LO phonons always have higher frequencies than the TO phonons of the 
same wavevector, so we are not concerned with LO phonon condensation. 
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In a polarization catastrophe the local electric field caused by the ionic 
displacement is larger than the elastic restoring force, thereby giving an asym- 
metrical shift in the positions of the ions. Higher order restoring forces will 
limit the shift to a finite displacement. 

The occurrence of ferroelectricity (and antiferroelectricity) in many 
perovskite-structure crystals suggests that this structure is favorably disposed 
to a displacive transition. Local field calculations make clear the reason for the 
favored position of this structure: the O 2 ” ions do not have cubic surround- 
ings, and the local field factors turn out to be unusually large. 

We give first the simple form of the catastrophe theory, supposing that the 
local field at all atoms is equal to E + 47 tP/ 3 in CGS or E + P/3e 0 in SI. The 
theory given now leads to a second-order transition; the physical ideas can be car- 
ried over to a first-order transition. In a second-order transition there is no latent 
heat; the order parameter (in this instance, the polarization) is not discontinuous 
at the transition temperature. In a first-order transition there is a latent heat; the 
order parameter changes discontinuously at the transition temperature. 

We rewrite (24) for the dielectric constant in the form 


(CGS) 


e = 


1 + j^A 

1 


(30) 


where a { is the electronic plus ionic polarizability of an ion of type i and N, is 
the number of ions i per unit volume. The dielectric constant becomes infinite 
and permits a finite polarization in zero applied field when 

(CGS) 2A 7 ,a, - 3/4 tt . (31) 


This is the condition for a polarization catastrophe. 

The value of e in (30) is sensitive to small departures of X V/r, from the 
critical value 3/477. If we write 

(CGS) (4ir/3)2iV,a < = I - 3s , (32) 

where s *£ 1, the dielectric constant in (30) becomes 

€ — 1/s . (33) 

Suppose near the critical temperature s varies linearly with temperature: 

s = (T — TM , (34) 

where £ is a constant. Such a variation of s or X 'N i a i might come from normal 
thermal expansion of the lattice. The dielectric constant has the form 


g 

T-T, 


(35) 


close to the observed temperature variation in the paraelectric state, Fig. 12. 
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Figure 12 Dielectric constant versus 1/(T — Tj in the paraelectric state (T > T c ) of perovskites, 
after G. Rupprecht and R. O. Bell, 

Soft Optical Phonons 

The Lyddane-Sachs-Teller relation (Chapter 14) is 

= e(<»)/e( 0) . (36) 

The static dielectric constant increases when the transverse optical phonon fre- 
quency decreases. When the static dielectric constant e(0) has a high value, 
such as 100 to 10,000, we find that o> T has a low value. 

When co T = 0 the crystal is unstable and e(0) is infinite because there is no 
effective restoring force. The ferroelectric BaTi0 3 at 24°C has a TO mode at 
12 cm -1 , a low frequency for an optical mode. 

If the transition to a ferroelectric state is first order, we do not find a>-, = 0 
or e(0) = 00 at the transition. The LST relation suggests only that e(0) extrapo- 
lates to a singularity at a temperature T 0 below T c . 

The association of a high static dielectric constant with a low-frequency 
optical mode is supported by experiments on strontium titanate, SrTi0 3 . 
According to the LST relation, if the reciprocal of the static dielectric constant 
has a temperature dependence l/e(0) °c (T — T 0 ), then the square of the 
optical mode frequency will have a similar temperature dependence: 
o)j « (T — T 0 ), if w, is independent of temperature. The result for w* is 
very well confirmed by Fig. 13. Measurements of cd t versus T for another 
ferroelectric crystal, SbSI, are shown in Fig. 14. 
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V2 





Temperature, K 

Figure 13 Plot of the square of the frequency of the zero wavevector transverse optical mode 
against temperature, for SrTiO.,, as observed in neutron diffraction experiments by Cowley. The bro- 
ken line is the reciprocal of the dielectric constant from the measurements of Mitsui and Westphal. 



Figure 14 Decrease of a transverse phonon frequency as the Curie temperature is approached 
from below, in the ferroelectric crystal antimony sulphoiodide, SbSI. (After Raman scattering 
experiments by C. H. Perry and D. K. Agrawal.) 


Landau Theory of the Phase Transition 

A ferroelectric with a first-order phase transition between the ferroelec- 
tric and the paraelectric state is distinguished by a discontinuous change of the 
saturation polarization at the transition temperature. The transition between 
the normal and superconducting states is a second-order transition, as is the 
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transition between the ferromagnetic and paramagnetic states. In these transi- 
tions the degree of order goes to zero without a discontinuous change as the 
temperature is increased. 

We can obtain a consistent formal thermodvnamie theory of the behavior 
of a ferroelectric crystal by considering the form of the expansion of the en- 
ergy as a function of the polarization P. We assume that the Landau 1 free en- 
ergy density F in one dimension may be expanded formally as 

F(P;T,E) = -EP + g 0 + §g 2 P 2 + + IgeJ* + ' ' • , (37) 

where the coefficients g n depend on the temperature. 

The series does not contain terms in odd powers of P if the unpolarized 
crystal has a center of inversion symmetry, but crystals are known in which odd 
powers are important. Power series expansions of the free energy do not al- 
ways exist, for nonanalytic terms are known to occur, especially when very near 
a transition. For example, the transition in K 1 1 2 PO , appears to have a logarith- 
mic singularity in the heat capacity’ at the transition, which is not classifiable as 
cither first or second order. 

The value of P in thermal equilibrium is given by the minimum of F as a 
function of P; the value of F at this minimum defines the Helmholtz free en- 
ergy F(T,E). The equilibrium polarization in an applied electric field E satis- 
fies the extremum condition 

|| = 0=-E+g 2 r + g 4 P 3 + g 6 P 3 + -" ■ (38) 

In this section we assume that the specimen is a long rod with the external ap- 
plied field E parallel to the long axis. 

To obtain a ferroelectric state we must suppose that the coefficient of the 
term in P 2 in (37) passes through zero at some temperature 7’ 0 : 

g 2 =yC r-T D ), (39) 

where y is taken as a positive constant and T y may be equal to or lower than 
the transition temperature. A small positive value of g 2 means that the lattice is 
“soft” and is close to instability. A negative value of g 2 means that the unpolar- 
ized lattice is unstable. The variation of g 2 with temperature is accounted for 
by thermal expansion and other effects of anharmonic lattice interactions. 

Second-Order Transition 

If g 4 in (37) is positive, nothing new is added by the term in g 6 , and this 
mav then he neglected. The polarization for zero applied electric field is found 
from (38): 

y(T-T 0 )P s +g 4 P^ = 0 , (40) 


In TP see pp. 69 and 298 for a discussion of the Landau function. 
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Figure 15 Spontaneous polarization versus temperature, for a second-order phase transition. 



Figure 16 Temperature variation of the polar-axis static dielectric constant of LiTaO a . (After 
Glass.) 


so that either P, = 0 or Pf = (yfg 4 )(T 0 - T). For T > T 0 the only real root of (40) 
is at P s = 0, because y and g 4 are positive. Thus T 0 is the Curie temperature. For 
T < T 0 the minimum of the Landau free energy in zero applied field is at 

\r s \ = (y/gd 1/2 (To-Tr , (4i) 

as plotted in Fig. 15. The phase transition is a second-order transition because 
the polarization goes continuously to zero at the transition temperature. The 
transition in LiTaO^ is an example (Fig. 16) of a second-order transition. 
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First-Order Transition 


The transition is first order if g 4 in (37) is negative. We must now retain g 6 
and take it positive in order to restrain F from going to minus infinity (Fig- 17). 
The equilibrium condition for E = 0 is given by (38): 

y(T - T 0 )P S - l&jP? + g 6 ff = 0 , (42) 


so that either P s = 0 or 


y(T — T 0 ) -l&lf? +ge^ = 0 . 


(43) 


At the transition temperature T c the free energies of the paraelectric and 
ferroelectric phases will be equal. That is, the value of F for P s = 0 will be equal 
to the value of F at the minimum given by (43). In Fig. 18 wc show the charac- 
teristic variation with temperature of P s for a first-order phase transition; 



Figure 17 Landau free energy function versus (polarization) 2 in a first-order transition, at repre- 
sentative temperatures. At T r the Landau function has equal minima at P = 0 and at a finite P as 
shown. For T below T c the absolute minimum is at larger values of P; as T passes through T, there 
is a discontinuous change in the position of the absolute minimum. The arrows mark the minima. 



Figure 18 Calculated values 
of the spontaneous polar- 
ization as a function of tem- 
perature, with parameters as 
for barium titanate. (After 
W. Cochran.) 
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contrast this with the variation shown in Fig. 15 for a second-order phase tran- 
sition. The transition in BaTiO^ is first order. 

The dielectric constant is calculated from the equilibrium polarization in 
an applied electric field E and is found from (38). In equilibrium at tempera- 
tures over the transition, the terms in P 4 and P 6 may be neglected; thus E = 
y(T - T 0 )P, or 

(CGS) e(T > T c ) = 1 + 4t tP/E = 1 + 47r/y(T - T 0 ) , (44) 

of the form of (36). The result applies whether the transition is of the first or 
second order, but if second order we have T a — T c ; if first order, then To < f r.- 
Equation (39) defines T 0 , but T c is the transition temperature. 
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Figure 19 Schematic representation of fundamental types of structural phase transitions from a 
centrosym metric prototype. (After Lines and Glass.) 
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Antiferroelectricity 

A ferroelectric displacement is not the only type of instability that may 
develop in a dielectric crystal. Other deformations occur, as in Fig. 19. These 
deformations, even if they do not give a spontaneous polarization, may be ac- 
companied by changes in the dielectric constant. One type of deformation is 
called antiferroelectric and has neighboring lines of ions displaced in oppo- 
site senses. The perovskite structure appears to be susceptible to many types of 
deformation, often with little difference in energy between them. The phase 
diagrams of mixed perovskite systems, such as the PbZr0 3 -PbTi0 3 system, 
show transitions between para-, ferro-, and antiferroelectric states (Fig. 20). 
Several crystals believed to have an ordered nonpolar state are listed in Table 3. 

Ferroelectric Domains 

Consider a ferroelectric crystal (such as barium titanate in the tetragonal 
phase) in which the spontaneous polarization may be either up or down the c- 
axis of the crystal. A ferroelectric crystal generally consists of regions called 
domains within each of which the polarization is in the same direction, hut in 
adjacent domains the polarization is in different directions. In Fig. 21 the po- 
larization is in opposite directions. The net polarization depends on the differ- 
ence in the volumes of the upward- and downward- directed domains. The 



PbZrOs Mole percent PbTiOj PbTi0 3 

Figure 20 Ferroelectric F, antiferroelectric A, and paraelectrie P phases of the lead zirconate-lead 
titanate solid solution system. The subscript T denotes a tetragonal phase; C a cubic phase; R a 
rhombohedral phase, of which there are high-temperature (HT) and low-temperature (LT) forms. 
Near the rhombohedral-tetragonal phase boundaries one finds very high piezoelectric coupling 
coefficients. (After Jaffe.) 
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Table 3 Antiferroelectric crystals 


Crystal 

Transition temperature to 
antiferroelectric state, in K 

wo 3 

1010 

NaNb0 3 

793,911 

PbZr0 3 

506 

PbHf0 3 

488 

nh 4 h 2 po 4 

148 

nd 4 d 2 po 4 

242 

NH 4 H 2 As0 4 

216 

ND 4 D 2 As0 4 

304 

(NH 4 ) 2 H 3 io 6 

254 


From a compilation by Walter J. Merz. 
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Figure 21 (a) Schematic drawing of atomic displacements on either side of a boundary between 
domains polarized in opposite directions in a ferroelectric crystal; (b) view of a domain structure, 
showing 180° boundaries between domains polarized in opposite directions. 


crystal as a whole will appear to be unpolarized, as measured by the charge on 
electrodes covering the ends, when the volumes of domains in opposite senses 
are equal. The total dipole moment of the crystal may be changed by the 
movement of the walls between domains or by the nucleation of new domains. 

Figure 22 is a series of photomicrographs of a single crystal of barium 
titanate in an electric field normal to the plane of the photographs and parallel 
to the tetragonal axis. The closed curves are boundaries between domains 
polarized into and out of the plane of the photographs. The domain bound- 
aries change size and shape when the intensity of the electric field is altered. 
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550 Y/cni 705 V/cm 980 Worn 

0.01 cm 


Figure 22 Ferroelectric domains on the face of a single crystal of barium titanate. The face is 
normal to the tetragonal or c axis. The net polarization of the crystal as judged by domain volumes 
is increased markedly as the electric field intensity parallel to the axis is increased from 550 
volts/cm to 980 V/cm. The domain boundaries are made visible by etching the crystal in a weak 
acid solution. (R. C. Miller.) 


Piezoelectricity 

All crystals in a ferroelectric state are also piezoelectric: a stress Z applied 
to the crystal will change the electric polarization (Fig. 23). Similarly, an elec- 
tric field E applied to the crystal will cause the crystal to become strained. In 
schematic one-dimensional notation, the piezoelectric equations are 

(CGS) P = Zd + Ex-, e = Zs + Ed , (45) 

where P is the polarization, Z the stress, d the piezoelectric strain constant, 
E the electric field, \ the dielectric susceptibility, e the elastic strain, and s the 
elastic compliance constant. To obtain (45) in SI, replace x by CqX- These rela- 
tions exhibit the development of polarization by an applied stress and the de- 
velopment of elastic strain by an applied electric field. 

A crystal may be piezoelectric without being ferroelectric: a schematic ex- 
ample of such a structure is given in Fig. 24. Quartz is piezoelectric, but not 
ferroelectric; barium titanate is both. For order of magnitude, in quartz d ~ 
10~‘ cm/statvolt and in barium titanate d ~ 10” 5 cm/statvolt. The general defi- 
nition of the piezoelectric strain constants is 

d, k = (deddEJz , (46) 

where i = x, y, z and k = xx, yy, zz, yz, zx, xy. To convert to cm/stat-V from 
values of d ik given in m/V, multiply by 3 X 10 4 . 

The lead zirconate-lead titanate system (called the PZT system), Fig. 20, 
is widely used in polycrystalline (ceramic) form with compositions of very high 
piezoelectric coupling. The synthetic polymer polyvinylidenfluoride (PVF 2 ) is 
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(a) (b) 


Figure 23 (a) Unstressed ferroelectric crystal and (b) stressed ferroelectric crystal. The stress 
changes the polarization by AP, the induced piezoelectric polarization. 



Figure 24 (a) The unstressed crystal has a threefold symmetry axis. The arrows represent dipole 
moments; each set of three arrows represents a planar group of ions denoted by A^B 3- , with a 
B 3- ion at each vertex. The sum of the three dipole moments at each vertex is zero, (b) The crystal 
when stressed develops a polarization in the direction indicated. The sum of the dipole moments 
about each vertex is no longer zero. 


five times more strongly piezoelectric than crystalline quartz. Thin stretched 
films of PVF 2 are flexible and as ultrasonic transducers are applied in medicine 
to monitor blood pressure and respiration. 


SUMMARY 

(In CGS Units) 

• The electric field averaged over the volume of the specimen defines the 
macroscopic electric field E of the Maxwell equations. 

• The electric field that acts at the site r, of an atom j is the local electric field, 
E| oc . It is a sum over all charges, grouped in terms as Ei oe (r / ) = E 0 + E! + 
E 2 + E 3 (r,), where only E 3 varies rapidly within a cell. Here: 

E 0 = external electric field; 

E 3 = depolarization field associated with the boundary of the specimen; 

E 2 = field from polarization outside a sphere centered about ry, 

E 3 (ij) = field at ij due to all atoms inside the sphere. 
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• The macroscopic field E of the Maxwell equations is equal to E 0 + E x , 
which, in general, is not equal to E] oc (rj). 

• The depolarization field in an ellipsoid is E lfl = — where N^, is the 

depolarization tensor; the polarization P is the dipole moment per unit vol- 
ume. In a sphere N = 47r/3. 

• The Lorentz field is E 2 = 47rP/3. 

• The polarizability a of an atom is defined in terms of the local electric field 
as p = aE l0C . 

• The dielectric susceptibility x and dielectric constant e are defined in terms 
of the macroscopic electric field E as D = E + 4irP = eE = (1 + 47r^)E, 
or x = P/E- In SI, we have x = P/^oE. 

• An atom at a site with cubic symmetry has E loc = E + (47r/3)P and satisfies 
the Clausius-Mossotti relation (24). 


Problems 

1. Polarizability of atomic hydrogen. Consider a semiclassical model of the ground 
state of the hydrogen atom in an electric field normal to the plane of the orbit 
(Fig. 25), and show that for this model a = a\, where a H is the radius of the un- 
perturbed orbit. Note: If the applied field is in the x direction, then the x compo- 
nent of the field of the nucleus at the displaced position of the electron orbit must 
be equal to the applied field. The correct quantum-mechanical result is larger than 
this by the factor |. (We are speaking of a 0 in the expansion a = a 0 + a Y E +•••.) 
We assume x < a H . One can also calculate a x on this model. 



Figure 25 An electron in a circular orbit of radius a H 
is displaced a distance x on application of an electric 
field E in the —x direction. The force on the electron 
due to the nucleus is e 2 /aj i in CGS or e i IAire (t a 2 l in SI. 
The problem assumes x < a H . 
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2. Polarizability of conducting sphere. Show that the polarizability of a conduct- 
ing metallic sphere of radius a is a = a 3 . This result is most easily obtained by not- 
ing that E = 0 inside the sphere and then using the depolarization factor 47r/3 for a 
sphere (Fig. 26). The result gives values of a of the order of magnitude of the ob- 
served polarizabilities of atoms. A lattice of N conducting spheres per unit volume 
has dielectric constant e = 1 + 4t rfVa 3 , for Na 3 1. The suggested proportionality 
of a to the cube of the ionic radius is satisfied quite well for alkali and halogen 
ions. To do the problem in SI, use 5 as the depolarization factor. 

3. Effect of air gap. Discuss the effect of an air gap (Fig. 27) between capacitor 
plates and dielectric on the measurement of high dielectric constants. What is the 
highest apparent dielectric constant possible if the air gap thickness is 10 -3 of the 
total thickness? The presence of air gaps can seriously distort the measurement of 
high dielectric constants. 

4. Interfacial polarization. Show that a parallel-plate capacitor made up of two par- 
allel layers of material — one layer with dielectric constant e, zero conductivity, and 
thickness d, and the other layer with e = 0 for convenience, finite conductivity cr, 


Figure 26 The total field inside a conducting sphere is zero. If 
a field E 0 is applied externally, then the field Ej due to surface 
charges on the sphere must just cancel E 0 , so that E 0 + E t = 0 
within the sphere. But Ej can be simulated by the depolariza- 
tion field — 4ttP/3 of a uniformly polarized sphere of polariza- 
tion P. Relate P to E 0 and calculate the dipole moment p of the 
sphere. In SI the depolarization field is — P/3e 0 . 



Figure 27 An air gap of thickness qd is in series 
in a capacitor with a dielectric slab of thickness d. 
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and thickness qd — behaves as if the space between the condenser plates were 
filled with a homogeneous dielectric with dielectric constant 

_ e(l + g) 

1 — {ieoxjl'liTcr) 

where (o is the angular frequency. Values of e„.-f as high as 10 4 or 10 5 caused largely 
by this Maxwell- Wagner mechanism are sometimes found, but the high values are 
always accompanied by large ac losses. 

5. Polarization of sphere. A sphere of dielectric constant e is placed in a uniform ex- 
ternal electric field E 0 . (a) What is the volume average electric field E in the sphere? 

(b) Show that the polarization in the sphere is P = yE 0 /j 1 + ;4iry/3) ,, where \ ~ 
(e — l)/4ir. Hint: You do not need to calculate E loc in this problem; in fact it is con- 
fusing to do so, because e and x are defined so that P — \E. We require E 0 to be un- 
changed by insertion of the sphere. We can produce a fixed E 0 by placing positive 
charges on one thin plate of an insulator and negative charges on an opposite plate. If 
the plates are always far from the sphere, the field of the plates will remain un- 
changed when the sphere is inserted between them. The results above are in CGS. 

6. Ferroelectric criterion for atoms. Consider a system of two neutral atoms sepa- 
rated by a fixed distance a, each atom having a polarizability cr. Find the relation 
between a and a for such a system to be ferroelectric. Hint: The dipolar field is 
strongest along the axis of the dipole. 

7. Saturation polarisation at Curie point. In a first-order transition the equilibrium 
condition (43) with T set equal to T,. gives one equation for the polarization P S (T C ). 
A further condition at the Curie point is that F(P„ T c ) = F( 0, T c ). (a) Combining 
these two conditions, show that P*(T C ) = 3ig 4 !/4g 6 . (b) Using this result, show that 
T c = T o + 3g|/16yg 6 . 

8. Dielectric constant below transition temperature. In terms of the parameters 
in the Landau free energy expansion, show that for a second-order phase transition 
the dielectric constant below the transition temperature is 

e = l + 4ir&P/E = 1 + 2v/y(T, - T) . 

This result may he compared with (44) above the transition. 

9. Soft modes and lattice transformations. Sketch a monatomic linear lattice of 
lattice constant a. (a) Add to each of six atoms a vector to indicate the direction of 
the displacement at a given time caused by a longitudinal phonon with wavevector 
at the zone boundary, (b) Sketch the crystal structure that results if this zone 
boundary’ phonon becomes unstable (a> —¥ 0) as the crystal is cooled through T, . 

(c) Sketch on one graph the essential aspects of the longitudinal phonon dispersion 
relation for the monatomic lattice at T well above T c and at T = T, : . Add to the 
graph the same information for phonons in the new structure at T well below T, : . 

10. Ferroelectric linear array. Consider a line of atoms of polarizability a and sepa- 
ration a. Show that the array can polarize spontaneously if a ^ a 3 /4X» 3 , where 
the sum is over all positive integers and is given in tables as 1.202. . . . 
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CHAPTER 17; SURFACE AND INTERFACE PHYSICS 


Reconstruction and Relaxation 

The surface of a crystalline solid in vacuum is generally defined as the 
few, approximately three, outermost atomic layers of the solid that differ sig- 
nificantly from the bulk. The surface may be entirely clean or it may have for- 
eign atoms deposited on it or incorporated in it. The bulk of the crystal is 
called the substrate. 

If the surface is clean the top layer may be either reconstructed or, 
sometimes, unreconstructed. In unreconstnicted surfaces the atomic arrange- 
ment is in registry with that of the hulk except for an interlayer spacing change 
(called multilayer relaxation) at the top surface. 

The shrinking of the interlayer distance between the first and second layer 
of atoms with respect to subsequent layers in the bulk is a rather dominant 
phenomenon. The surface may be thought of as an intermediate between a di- 
atomic molecule and the bulk structure. Because the interatomic distances in 
diatomic molecules are much smaller than in the bulk, there is a rationale for 
the surface relaxation. This may be contrasted with reconstruction where the 
relaxation of atoms yields new surface primitive cells. In relaxation the atoms 
maintain their structure in the surface plane as it was (according to the projec- 
tion of the bulk cell on the surface); only their distance from the bulk changes. 

Sometimes in metals, but most often in nonmetals, the atoms in the 
surface layer form superstructures in which the atoms in the layer are not in 
registry with the atoms in corresponding layers in the substrate. This is surface 
reconstruction; it can be a consequence of a rearrangement of broken covalent 
or ionic bonds at the surface. Under such conditions the atoms at the surface 
bunch into rows with alternately larger and smaller spacings than in the bulk. 
That is, for some crystals held together by valence bonds, creation of a surface 
w’ould leave unsaturated bonds dangling into space (Fig. 1). The energy may 
then be lowered if neighboring atoms approach each other and form bonds 
with their otherwise unused valence electrons. Atomic displacements can be 
as large as 0.5 A. 

Reconstruction does not necessarily require formation of a superstruc- 
ture. For example, on GaAs (110) surfaces a rotation of the Ga-As bond leaves 
the point group intact. The driving force is electron transfer from Ga to As, 
which fills the dangling bonds on As and depletes them on Ga. 

Surfaces of planes nominally of high indices may be built up of low index 
planes separated by steps one (or two) atoms in height. Such terrace-step 
arrangements are important in evaporation and desorption because the attach- 
ment energy of atoms is often low at the steps and at kinks in the steps. The 
chemical activity of such sites may be high. The presence of periodic arrays of 
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steps may he detected by double and triple beams of diffraction in LEED (see 
below) experiments. 


SURFACE CRYSTALLOGRAPHY 


The surface structure is in general periodic only in two dimensions. The 
surface structure can be the structure of foreign material deposited on the 
substrate or it can be the selvage of the pure substrate. In Chapter 1 we used 
the term Bravais lattice for the array of equivalent points in two or in three di- 
mensions, that is, for diperiodic or triperiodic structures. In the physics of sur- 
faces it is common to speak of a two-dimensional lattice. Further, the area 
unit may be called a mesh. 

We showed in Fig. 1.7 four of the five nets possible for a diperiodic struc- 
ture; the fifth net is the general oblique net, with no special symmetry relation 
between the mesh basis vectors a 1; a 2 . Thus the five distinct nets are the 
oblique, square, hexagonal, rectangular, and centered rectangular. 

The substrate net parallel to the surface is used as the reference net for 
the description of the surface. For example, if the surface of a cubic substrate 
crystal is the (111) surface, the substrate net is hexagonal (Fig. 1.7b), and the 
surface net is referred to these axes. 

The vectors c 1; c 2 that define the mesh of the surface structure may be ex- 
pressed in terms of the reference net a^ a 2 by a matrix operation P: 





( 1 ) 


Provided that the included angles of the two meshes are equal, the short- 
hand notation due to E. A. Wood may be used. In this notation, which is 
widely used, the relation of the mesh c 1; c 2 to the reference mesh a,, a 2 is ex- 
pressed as 

(§ *§)*“• < 2 » 


in terms of the lengths of the mesh basis vectors and the angle a of relative ro- 
tation R of the two meshes. If a — 0, the angle is omitted. Examples of the 
Wood notation are given in Fig. 2. 

The reciprocal net vectors of the surface mesh may be written as cj, c 2 , 
defined by 

c i ' c 2 = <V c i = 0 ; Cj’cJ = Ca'Ca = 2ir (or 1) . (3) 

Here the 27T (or 1) indicates that two conventions are in use. The definitions 
(3) used in Fig. 3 may be compared with the definitions in Chapter 2 of the 
reciprocal lattice vectors of a triperiodic lattice. 
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p(lxl) p(2 x 2) (V3xV3)K30° 

fcc(lll), hcp(OOOl) 

(a) 


p(l x 1) c(2x2) p(2x2) 

fcc(lOO), bcc(lOO) 

(b) 



p(2 x 1) 



bcc(llO) fcc(l 10) 

(c) (d) 

Figure 2 Surface nets of adsorbed atoms. The circles represent atoms in the top layer of the sub- 
strate. In (a) the designation fce(lll) means the (111) face of an fee structure. Tins face deter- 
mines a reference net. The lines represent ordered overlayers, with adatoms at the intersections of 
two lines. The intersection points represent diperiodic nets (lattices in two dimensions). The des- 
ignation p(l X 1) in (a) is a primitive mesh unit for which the basis is identical with the basis of 
the reference net. In (b) the e(2 X 2) mesh unit is a centered mesh with basis vectors twice as long 
as those of the reference net. Atomic adsorption on metals takes place most often into those sur- 
face sites (hollow sites) that maximize the number of nearest-neighbor atoms on the substrate. 
(After Van Hove.) 


The reciprocal net points of a diperiodic net may be thought of- — when we 
are in three dimensions — as rods. The rods are infinite in extent and normal to 
the surface plane, where they pass through the reciprocal net points. It may be 
helpful to think of the rods as generated by a triperiodic lattice which is ex- 
panded without limit along one of its axes. Then the reciprocal lattice points 
along this axis are moved closer together and in the limit form a rod. 

The usefulness of the rod concept comes out with the Ewald sphere 
construction explained in Fig. 2.8. Diffraction occurs everywhere the Ewald 
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Figure 3 A (3 X 1) surface structure, (a) real-space; and (b) reciprocal-space diagrams. (After 
E. A. Wood.) 



(03) (02) (01) (00) (01) (02) 


Figure 4 Ewald sphere construction for diffraction of incident wave k by a square net, when k is 
parallel to one axis of the mesh. The back scattered beams in the plane of the paper are k(,, ks, ks, 
k^. Diffracted beams out of the plane of the paper will also occur. The vertical lines are the rods of 
the reciprocal net. 


sphere intercepts a reciprocal net rod. Each diffracted beam is labelled with 
the indices hk of the reciprocal net vector 


forming the beam. 


g = hc\ + kc * 2 


( 4 ) 



1 7 Surface and Interface Physics 493 



Figure 5 LEED patterns from a Pt(lll) crystal surface for incident electron energies of 51 and 
63.5 eV. The diffraction angle is greater at the lower energy. (After G. A. Somorjai, Chemistry in 
two dimensions: surfaces, Cornell, 1981.) 



Figure 6 The RHEED method. In (a) the high-energy incident electron beam at a glancing 
angle to the crystal surface is associated with an Ewald sphere of large radius, so large that the sur- 
face is nearly flat in relation to the separation between adjacent rods of the reciprocal net. The 
formation of diffraction lines on a plane screen is shown in (b). (After Prutton.) 


Low energy electron diffraction (LEED) is illustrated by Fig. 4. The 
electron energy is typically in the range 10-1000 eV. With this arrangement 
Davisson and Germer in 1927 discovered the wave nature of the electron. An 
experimental pattern is shown in Fig. 5. 

Reflection High-Energy Electron Diffraction. In the RHEED method a 
beam of high-energy electrons is directed upon a crystal surface at grazing 
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incidence. By adjustment of the angle of incidence one can arrange the normal 
component of the incoming wavevector to be very small, which vail minimize 
the penetration of the electron beam and enhance the role of the crystal 
surface. 

The radius k of the Ewald sphere for 100 keV electrons wall be =10 3 A~ ! , 
which is much longer than the shortest reciprocal lattice vector 27 r/a ~ 1 A -1 . 
It follows that the Ewald sphere will be nearly a flat surface in the central scat- 
tering region. The intercept of the rods of the reciprocal net with the nearly 
flat sphere will be nearly a line when the beam is directed at grazing inci- 
dence. The experimental arrangement is shown in Fig. 6. 

SURFACE ELECTRONIC STRUCTURE 


Work Function 

The work function W of the uniform surface of a metal is defined as the 
difference in potential energy of an electron between the vacuum level and 
the Fermi level. The vacuum level is the energy of an electron at rest at a point 
sufficiently far outside the surface so that the electrostatic image force on the 
electron may be neglected — more than 100 A from the surface. The Fermi 
level is the electrochemical potential of the electrons in the metal. 

Typical values of electron work functions are given in Table 1. The orien- 
tation of the exposed crystal face affects the value of the work function 


Table 1 Electron work functions 3 


(Values obtained by photoemission, except tungsten obtained by field emission.) 


Element Surface plane Work function, in eV 


A g 

(100) 

4.64 


(110) 

4.52 


(111) 

4.74 

Cs 

polycrystal 

2.14 

Cu 

(100) 

4.59 


(110) 

4.48 


(111) 

4.98 

Ge 

(111) 

4.80 

Ni 

(100) 

5.22 


(110) 

5.04 


(111) 

5.35 

W 

(100) 

4.63 


(110) 

5.25 


(111) 

4.47 


“After H. D. Hagstrum. 
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because the strength of the electric double layer at the surface depends on 
the concentration of surface positive ion cores. The double layer exists 
because the surface ions are in an asymmetrical environment, with vacuum 
(or an adsorbed foreign atom layer) on one side and the substrate on the 
other side. 

The work function is equal to the threshold energy for photoelectric emis- 
sion at absolute zero. If fiw is the energy of an incident photon, then the 
Einstein equation is hw — W + T, where T is the kinetic energy of the emitted 
electron and W is the work function. 

Thermionic Emission 

The rate of emission of thermionic electrons depends exponentially on the 
work function. The derivation follows. 

We first find the electron concentration in vacuum in equilibrium 
with electrons in a metal at temperature r(=k B T) and chemical potential /x. 
We treat the electrons in the vacuum as an ideal gas, so that their chemical 
potential is 

M = M'cxi + rlog (n/n Q ) , (5) 

by TP, Chapter 5. Here 

n Q - 2(niT/2irh 2 ) 3/2 , (6) 

for particles of spin 1/2. 

Now /-L cxt - fj, = W, by the definition of the work function W. Thus, 
from (5), 

n — n ^ exp(— W/t) . (7) 

The flux of electrons that leaves the metal surface when all electrons are 
drawn off is equal to the flux incident on the surface from outside: 

J n = \nc~ n(jl2’Trm) m (8) 

by TP(14.95) and (14.121). Here c is the mean speed of the electrons in the 
vacuum. The electric charge flux is ej„ or 

/ e = (r 2 me/2ir 2 ft 3 )exp{ — W/t) . (9) 

This is called the Richardson-Dushman equation for thermionic emission. 

Surface States 

At the free surface of a semiconductor there often exist surface-bound 
electronic states with energies in the forbidden gap between the valence and 
conduction bands of the bulk semiconductor. We can obtain a good impression 
of the nature of the surface states by considering the wave functions in the 
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weak binding or two-component approximation of Chapter 7, in one dimen- 
sion. (The wave functions in three dimensions will have extra factors 
exp| i(k y ij + k-z)\ in the y, z plane of the surface.) 

If the vacuum lies in the region x > 0, the potential energy of an electron 
in this region can be set equal to zero: 

U(x) = 0 , .T > 0 . (10) 

In the crystal the potential energy has the usual periodic form: 

U(x) = 2 Uc exp(iGx) , x < 0 . (11) 

a 

In one dimension G = nir!a, where n is any integer, including zero. 

In the vacuum the wave function of a hound surface state must fall off 
exponentially: 

i/W = ewp(sx) , x>0 . (12) 

By the wave equation the energy of the state referred to the vacuum level is 

e = —h 2 s 2 /2m . (13) 

Within the crystal the two-component wave function of a bound surface 
state will have the form, for x < 0, 

i/i,„ = exp (qx + ikx)[C(k) + C(k — G)exp(— iGx)} , (14) 

by analogy with (7.49), but with the addition of the factor exp(qx) which serves 
to bind the electron to the surface. 

Wc now come to an important consideration that restricts the allowed val- 
ues of the wavevector k. If the state is bound, there can be no current flow in 
the x direction, normal to the surface. This condition is assured in quantum 
mechanics if the wave function can be written as a real function of x, a condi- 
tion already satisfied by the exterior wave function (12). But (14) can be a real 
function only if k — §G, so that 

i/r in = exp(qx)[C(|G)exp(iGx/2) + C( — fC)exp(— iGx/2)] . (15) 

This is real provided C*(|G) = C(— |G). Thus k x for a surface state does not 
have a continuum of values, but is limited to discrete states associated with 
Brillouin zone boundaries. 

The state (15) is damped exponentially in the crystal. The constants s, q 
are related by the condition that 4> and dip/dx are continuous at x = 0. The 
binding energy e is determined by solving the two-component secular equa- 
tion analogous to (7.46). The plot of Fig. 7.12 is helpful in this connection. 
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Tangential Surface Transport 

We have seen that there may exist surface-hound electronic states with en- 
ergies in the forbidden gap between the valence and conduction bands of the 
substrate crystal. These states may be occupied or vacant; their existence must 
affect the statistical mechanics of the problem. This means that the states mod- 
ify the local equilibrium concentration of electrons and holes, as expressed as a 
shift of the chemical potential relative to the band edges. Because the chemical 
potential is independent of position in an equilibrium system, the energy bands 
must be displaced or bent, as in Fig. 7. 

The thickness and carrier concentration in the surface layer may be 
changed by applying an electric field normal to the surface. The effect of an 
external field is utilized in the metal-oxide-semiconductor field-effect transis- 
tor (MOSFET). This has a metal electrode just outside the semiconductor 
surface and insulated from it by a layer of oxide. A voltage, the gate voltage V g , 
is applied between the metal and semiconductor that modulates the n s , the 
surface charge density per unit area: 

A n s = CgAVg , 

where C„ is the capacitance per unit area between the metallic gate and the 
semiconductor. This surface charge layer forms the conducting pathway of the 
MOSFET. The conductance of a surface layer of length L and width W 
between two electrical contacts is: 

G = (W/L)n s ep, , 

where p is the carrier mobility. The carrier density n s , and hence the conduc- 
tance, is controlled by the gate voltage. This three-terminal electronic valve is 


Vacuum Vacuum 

potential potential 




Figure 7 Band bending near a semiconductor surface that can give a highly conducting surface 
region, (a) Inversion layer on an n-type semiconductor. For the bending as shown, the hole con- 
centration at the surface is far larger than the electron concentration in the interior, (b) Accumula- 
tion layer on an ri-typc semiconductor, with an electron concentration at the surface that is far 
higher than in the interior. 
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a principal component in microelectronic systems. The electronic states occu- 
pied by the carriers at the surface are quantized along the direction normal to 
the interface, as treated in Problem 2. 


MAGNETORESISTANCE IN A TWO-DIMENSIONAL CHANNEL 


The static magnetoconductivity tensor in 3D was found in Problem 6.9. 
Here we translate that result to a 2D surface conductance channel in the xy 
plane, with the static magnetic field in the z direction, normal to the MOS 
layer. We assume the surface density of electrons is n s = N/L 2 . The surface 
conductance is defined as the volume conductivity times the layer thickness. 
The surface current density is defined as the current crossing a line of unit 
length in the surface. 

Thus, with (6.43) and (6.65) the surface tensor conductance components 
become 


<r„ = 


1 + (w c t) 2 


'xy 


1 4- (ft ) r jf 


(16) 


where cr 0 = n s e 2 r/m and oi, = eBlmc in CGS and eB/rn in SI. The following dis- 
cussion is written in CGS only, except where ohms are used. 

These results apply specifically in the relaxation time approximation used 
in Chapter 6. When (o c t > 1, as for strong magnetic field and low tempera- 
tures, the surface conductivity components approach the limits 

rr :ix = 0 ; tr = n s ec/B . (17) 

The limit for u,y is a general property' of free electrons in crossed electric 
E and magnetic fields B,. We establish the result that such electrons drift in 
the x direction with velocity v D — cE^/B,. Consider the electrons from a 
Lorentz frame that moves in the x direction with this velocity. By electromag- 
netic theory there is in this frame an electric field E’ y = —v D B./c that will can- 
cel the applied field E y for the above choice of v D . Viewed in the laboratory' 
frame, all electrons drift in the x direction with velocity v D in addition to any 
velocity components they had before E y was applied. 

Thus/, — cr„jE {j = n s ev n — { n s ec/B)E y , so that 

<r xl/ — n/.c/B (18) 

as in (17). The experiments measure the voltage V in the y direction and the 
current I in the x direction (Fig. 8). Here I x — j x L y — in,ec/B){E y L y ) = 
(n s ec/B)V y . The Hall resistance is 

p H — V y /l x — B/n s ec . (18a) 

We see that/, can flow with zero E x , so that the effective conductance jJF, x 
can be infinite. Paradoxically, this limit occurs only when cr xx and <J tn are zero. 
Consider the tensor relations 
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Figure 8 Applied field £ l; and drift current I x 
in a quantum Hall effect (IQHE) experiment. 

j x = cr xx E x + a^jEy ■ jj = <j ijx E x + (T Jy E y . (19) 

In the Hall effect geometry j y = 0, so that E lf = (a XIJ /a !r j)E x , with cr = —a <Jx . 
Thus 

jx = (Vxx + of/cr yy )E I = cr(eff)E r , (20) 

and in the limit cr xx = i T ytJ = 0 the effective conductance is infinite. 

Integral Quantized Hall Effect (IQHE) 

The results of the original measurements 1 under quantum conditions of 
temperature and magnetic field are shown in Fig. 9. The results are remarkable: 
at certain values of the gate voltage the voltage drop in the direction of current 
flow goes essentially to zero, as if the effective conductance were infinite. 
Further, there are plateaus of the Hall voltage near these same values of gate 
voltage, and the values of the Hall resistivity V H /l x at these plateaus are accu- 
rately equal to (25,813/integer) ohms, where 25,813 is the value of hie 2 
expressed in ohms. 

The IQHE voltage minima V pp may be explained on a model that is, how- 
ever, oversimplified. Later we give a general theory. Apply a strong magnetic field 
such that the separation h(o r > k B T. It is meaningful to speak of Landau levels 
that are completely filled or completely empty. Let the electron surface concen- 
tration (proportional to the gate voltage) be adjusted to any of the set of values 
that cause the Fermi level to fall at a Landau level: from (9.33) and (9.34), 

seBJhc = n s , (21) 

where s is any integer and n s is the electron surface concentration. 

When the above conditions are satisfied, the electron collision time is 
greatly enhanced. No elastic collisions are possible from one state to another 
state in the same Landau level because all possible final states of equal energy 
are occupied. The Pauli principle prohibits an elastic collision. Inelastic colli- 
sions to a vacant Landau level are possible with the absorption of the necessary 
energy from a phonon, but there are very few thermal phonons of energy 
greater than the interlevel spacing by virtue of the assumption hw c > k B T. 



’K. von Klitzing, G. Dorda, and M. Pepper, Phys. Rev. Lett. 45, 494 (1980). 
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(b) 


Figure 9 In the original IQHE measurements a magnetic field of 180 kG (18 T) points out of the 
paper. The temperature is 1.5 K. A constant current of 1 p A is made to flow between the source 
and the drain. Voltages V pp and V H are plotted versus the gate voltage V g , which is proportional to 
the Fermi level. (After K. von Klitzing, G. Dorda, and M. Pepper.) 

The quantization of the Hall resistance follows on combining (18a) and (21): 

p H = h/se 2 = 27 r/sca , (22) 

where a is the fine structure constant e~/hc s 1/137, and s is an integer. 

IQHE in Real Systems 

The measurements (Fig. 9) suggest that the above theory of the IQHE 
is too good. The Hall resistivity is accurately quantized at 25,813/s ohms, 
whether or not the semiconductor is of very high purity and perfection. The 
sharp Landau levels (Fig. 10a) are broadened in the real crystal (Fig. 10b), but 
this does not affect the Hall resistivity. The occurrence of plateaus in the Hall 
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(a) (b) 

Figure 10 Density of states in a 2D electron gas in a strong magnetic field, (a) Ideal 2D crystal, 
(b) Real 2D crystal, with impurities and other imperfections. 


B 


Figure 11 Geometry for Laughlin’s 
thought-experiment. The 2D electron sys- 
tem is wrapped around to form a cylinder. 
A strong magnetic field B pierces the 
cylinder everywhere normal to its surface. 
A current I circles the loop, giving rise to 
the Hall voltage V H and a small magnetic 
flux ip through the loop. 

1, is not expected in ideal systems 
because partially filled Landau levels will exist for all gate voltages except 
those for which the Fermi level exactly coincides with a Landau level. Yet the 
experiments show that a range of V g values gives the exact Hall resistance. 

Laughlin 2 interpreted the results for real systems as the expression of the 
general principle of gauge invariance. The argument is subtle and somewhat 
reminiscent of the flux quantization in a superconductor in Chapter 10. 

In Laughlin s thought-experiment the 2D electron system is bent to form a 
cylinder (Fig. 11) whose surface is pierced everywhere by a strong magnetic 
field B normal to the surface. The current I (former I x ) circles the loop. The 
magnetic field B acts on the charge carriers to produce a Hall voltage V H (for- 
mer Vy) perpendicular to the current and to B; that is, V H is developed be- 
tween one edge of the cylinder and the other. 

The circulating current I is accompanied by a small magnetic flux <p that 
threads the current loop. The aim of the thought-experiment is to find the 



resistance, evident in the U H curve of Fig. 9 


2 R. B. Laughlin, Phys. Rev. B 23, 5632 (1981); see also his article in the McGraw-Hill 
year-book of science and technology, 1984, pp. 209-214. A review is given by H. L. Stormer and 
D. C. Tsui, Science 220, 1241 (1983). 


502 


relation between I and V H . We start with the electromagnetic relation that re- 
lates I to the total energy U of a resistanceless system: 


dU 

dt 


= -vj* 


I 

c dt 


1 = 



(23) 


The value of 1 can now be found from the variation 8U of the electronic energy 
that accompanies a small variation 8<p of the flux. 

The carrier states divide into two classes: 


• Localized states, which are not continuous around the loop. 

• Extended states, continuous around the loop. 

Localized and extended states cannot coexist at the same energy’, according to 
our present understanding of localization. 

The two classes of states respond differently to the application of the flux 
ip. The localized states are unaffected to first order because they do not en- 
close any significant part of <p. To a localized state a change in (p looks like a 
gauge transformation, which cannot affect the energy of the state. 

The extended states enclose cp. and their energy may be changed. How- 
ever, if the magnetic flux is varied by a flux quantum, 8<p = hc/e, all extended 
orbits are identical to those before the flux quantum was added. The argument 
here is identical to that for the flux quantization in the superconducting ring 
treated in Chapter 10, but with the 2e of the Cooper pair replaced by e. 

If the Fermi level falls within the localized states of Fig. 10b, all extended 
states (Landau levels) below the Fermi level will be filled with electrons both 
before and after the flux change Sip. However, during the change an integral 
number of states, generally one per Landau level, enter the cylinder at one 
edge and leave it at the opposite edge. 

The number must be integral because the system is physically identical 
before and after the flux change. If the transferred state is transferred while 
occupied by one electron, it contributes an energy change eV H ; if N occupied 
states are transferred, the energy change is NeV H . 

This electron transfer is the only way the degenerate 2D electron system 
can change its energy. We can understand the effect by looking at a model sys- 
tem without disorder in the Landau gauge for the vector potential: 

A = -Byx . (24) 

An increase SA that corresponds to the flux increase 8<p is equivalent to a dis- 
placement of an extended state by 8A/B in the y direction. By the Stokes theo- 
rem and the definition of the vector potential we have 8(p — L,8A. Thus 8q> 
causes a motion of the entire electron gas in the y direction. 

By 8 U = NeV H and 8 ip = hc/e, we have 


I = c(AU/A(p) = cNe 2 V H /hc = (Ne 2 /h)V H , 


(25) 
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so that the Hall resistance is 

Ph — V u /I = h/Ne 2 , 


as in (22). 


(26) 


Fractional Quantized Hall Effect (FQHE). A quantized Hall effect has 
been reported for similar systems at fractional values of the index s, by 
working at lower temperatures and higher magnetic fields. In the extreme 
quantum limit the lowest Landau level is only partially occupied, and the inte- 
gral QBE treated above should not occur. It has been observed, 3 however, that 
the Hall resistance p H is quantized in units ot 3h/e 2 when the occupation of the 
lowest Landau level is 1/3 and 2/3, and p xx vanishes for these occupations. Sim- 
ilar breaks have been reported for occupations of 2/5, 3/5, 4/5, 
and 2/7. 


p-n JUNCTIONS 

A p-n junction is made from a single crystal modified in two separate re- 
gions. Acceptor impurity atoms are incorporated into one part to produce the 
p region in which the majority carriers are holes. Donor impurity atoms in the 
other part produce the n region in which the majority carriers are electrons. 
The interface region may be less than 10 4 cm thick. Away from the junction 
region on the p side there are (— ) ionized acceptor impurity atoms and an 
equal concentration of free holes. On the n side there are ( + ) ionized donor 
atoms and an equal concentration of free electrons. Thus the majority carriers 
are holes on the p side and electrons on the n side, Fig. 12. 

Holes concentrated on the p side would like to diffuse to fill the crystal 
uniformly. Electrons would like to diffuse from the n side. But diffusion will 
upset the local electrical neutrality of the system. 

A small charge transfer by diffusion leaves behind on the p side an excess 
of ( — ) ionized acceptors and on the n side an excess of ( + ) ionized donors. 
This charge double layer creates an electric field directed from n to p that in- 
hibits diffusion and thereby maintains the separation of the two carrier types. 
Because of this double layer the electrostatic potential in the crystal takes a 
jump in passing through the region of the junction. 

In thermal equilibrium the chemical potential of each carrier type is 
everywhere constant in the crystal, even across the junction. For holes 

k B T In p(r) -I- e(p{ r) = constant (27a) 


5 D. C. Tsui, H. L. Stormer, and A. C. Gossard, Phys. Kev. Lett. 48, 1562 ( 1982 ); A. M. Chang 
et at, Phys. Rev. Lett. 53, 997 (1984). For a discussion of the theory see R. Laughlin in G. Bauer 
et al., eds.. Two-dimensional systems, heterostructures , and superlattices. Springer, 1984 . 
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Figure 12 (a) Variation of the hole and elec- 
tron concentrations across an unbiased (zero 
applied voltage) junction. The carriers are in 
thermal equilibrium with the acceptor and 
donor impurity atoms, so that the product pn of 
the hole and electron concentrations is constant 
throughout the crystal in conformity with the 
law of mass action, (b) Electrostatic potential 
from acceptor ( — ) and donor ( + ) ions near the 
junction. The potential gradient inhibits diffu- 
sion of holes from the p side to the n side, and it 
inhibits diffusion of electrons from the n side to 
the p side. The electric field in the junction re- 
gion is called the built-in electric field. 


Concentration 



(a) 



(b) 


across the crystal, where p is the hole concentration and (p the electrostatic 
potential. Thus p is low where <p is high. For electrons 

k B T In n(r) — ecp(r ) — constant , (27b) 

and n will be low where cp is low. 

The total chemical potential is constant across the crystal. The effect of 
the concentration gradient exactly cancels the electrostatic potential, and the 
net particle flow of each carrier type is zero. However, even in thermal equi- 
librium there is a small flow of electrons from n to p where the electrons end 
their lives by combination with holes. The recombination current J nr is bal- 
anced by a current J ng of electrons which are generated thermally in the p re- 
gion and which are pushed by the built-in field to the n region. Thus in zero 
external applied electric field 

Jnr( 0)+J ng (0) = 0, (28) 

for otherwise electrons would accumulate indefinitely on one side of the barrier. 

Rectification 

A p-n junction can act as a rectifier. A large current will flow if we apply a 
voltage across the junction in one direction, but if the voltage is in the opposite 
direction only a very small current will flow. If an alternating voltage is applied 
across the junction the current will flow chiefly in one direction — the junction 
has rectified the current (Fig. 13). 
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0.01 0.1 1.0 10 100 Figure 13 Rectification charade ris- 

Current in milliamps/cm 2 tic of a p-n junction in germanium. 

after Shockley. 

For back voltage bias a negative voltage is applied to the p region and a 
positive voltage to the n region, thereby increasing the potential difference be- 
tween the two regions. Now practically no electrons can climb the potential 
energy hill from the low side of the barrier to the high side. The recombina- 
tion current is reduced by the Boltzmann factor: 

J JV back) = 0) exp ( -e\ V| !k B T) . (29) 

The Boltzmann factor controls the number of electrons with enough energy to 
get over the barrier. 

The thermal generation current of electrons is not particularly affected by 
the back voltage because the generation electrons flow downhill (from p to n) 
anyway: 

/ng(Vback) =/ ng (0) . (30) 

We saw in (28) that J nr ( 0) = -J n J0); thus the generation current dominates 
the recombination current for a back bias. 

When a forward voltage is applied, the recombination current increases 
because the potential energy barrier is lowered, thereby enabling more elec- 
trons to flow from the n side to the p side: 

J nr (y forward) = J nr ( 0) exp {e\ V| / k B T ) . (31) 

Again the generation current is unchanged: 

J n g (V forward) =/„ g (0) . 
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(32) 
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The hole current flowing across the junction behaves similarly to the elec- 
tron current. The applied voltage which lowers the height of the barrier for 
electrons also lowers it for holes, so that large numbers of electrons flow from 
the n region under the same voltage conditions that produce large hole cur- 
rents in the opposite direction. 

The electric currents of holes and electrons are additive, so that the total 
forward electric current is 

I = I,[exp(eV/k B T)-l] , (33) 

where I s is the sum of the two generation currents. This equation is well satis- 
fied for p-n junctions in germanium (Fig. 13), but not quite as well in other 
semiconductors. 

Solar Cells and Photovoltaic Detectors 

Let us shine light on a p-n junction, one without an external bias voltage. 
Each absorbed photon creates an electron and a hole. When these carriers dif- 
fuse to the junction, the built-in electric field of the junction separates them at 
the energy barrier. The separation of the carriers produces a forward voltage 
across the barrier: forward, because the electric field of the photoexcited carri- 
ers is opposite to the built-in field of the junction. 

The appearance of a forward voltage across an illuminated junction is 
called the photovoltaic effect. An illuminated junction can deliver power to 
an external circuit. Large area p-n junctions of silicon are used as solar panels 
to convert solar photons to electrical energy. 

Schottky Barrier 

When a semiconductor is brought into contact with a metal, there is formed 
in the semiconductor a barrier layer from which charge carriers are severely de- 
pleted. The barrier layer is also called a depletion layer or exhaustion layer. 

In Fig. 14 an n-type semiconductor is brought into contact with a metal. 
The Fermi levels are coincident after the transfer of electrons to the conduc- 
tion band of the metal. Positively charged donor ions are left behind in this re- 
gion that is practically stripped of electrons. Here the Poisson equation is 

(CGS) divD = 47me (SI) divD=ne/e 0 , (34) 

where n is the donor concentration. The electrostatic potential is determined by 

(CGS) d 2 cp/dx 2 = — 4t me/e (SI) d 2 (p/dx 2 = ne/ee 0 , (35) 

which has a solution of the form 

(SI) <P~— (ne/2€€ 0 )x 2 . 


(CGS) <p = —(2rme/e)x 2 


(36) 
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Vacuum Bottom of 
level conduction 



(a) Before contact (b) Just after (c) Equilibrium 

contact established 

Figure 14 Rectifying barrier between a metal and an n-type semiconductor. The Fermi level is 
shown as a broken line. 


The origin of x has been taken for convenience at the right-hand edge of 
the barrier. The contact is at — x b , and here the potential energy relative to the 
right-hand side is — ecp 0 , whence the thickness of the barrier is 

(CGS) = (e|(po|/2ime) 1/2 (SI) x b = (2ee 0 \<p 0 \/ne) 1/z . (37) 

With e = 16; e(p 0 = 0.5 eV; n = 10 16 cm' 3 , we find x b = 0.3 /xm. This is a some- 
what simplified view of the metal-semiconductor contact. 

HETEROSTRUCTURES 

Semiconductor heterostructures are layers of two or more different semi- 
conductors grown coherently with one common crystal structure. Heterostruc- 
tures offer extra degrees of freedom in the design of semiconductor junction 
devices, because both the impurity doping and the conduction and valence 
band offsets at the junction can be controlled. Because of this freedom many 
devices that utilize compound semiconductors incorporate heterostructures. 
Examples include semiconductor lasers in CD players and high-speed devices 
for cell-phone systems. 

A heterostructure may be viewed as a single crystal in which the occu- 
pancy of the atomic sites changes at the interface. As an example, one side of 
the interface can be Ge and the other side GaAs: both lattice constants are 
5.65 A. One side has the diamond structure and the other side the cubic zinc 
sulfide structure. Both structures are built up from tetrahedral covalent bonds 
and fit together coherently as if they were a single crystal. There are a few 
edge dislocations (Chapter 21) to relieve the strain energy near the interface. 

The band gaps, however, are different, and this difference is the source of 
the real interest in the heterostructure, apart from the technical virtuosity in 
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Normal Staggered Broken gap 



Figure 15 Three types of band edge offsets at hetero-interfaces. The forbidden gaps are shown 
shaded. The offset called normal occurs, for example, in GaAs/(Al,Ga)As. The “broken-gap” offset 
occurs in the GaSb/InAs heterojunction. 


forming the structure. The band gaps are 0.67 eV for Ge and 1.43 eV for GaAs, 
at 300 K. The relative alignment of the conduction and valence band edges of- 
fers several possibilities, as shown in Fig. 15. Calculations suggest that the top 
of the valence band E v in Ge should lie about 0.42 eV higher than in GaAs. The 
bottom of the conduction band E c in Ge should lie about 0.35 eV lower than in 
GaAs, so that the offsets are classified as normal in the scheme of Fig. 15. 

Band edge offsets act as potential barriers in opposite senses on electrons 
and holes. Recall that electrons lower their energy by “sinking” on an energy 
band diagram, whereas holes lower their energy by “floating” on the same dia- 
gram. For the normal alignment both electrons and holes are pushed by the 
barrier from the wide-gap to the narrow-gap side of the heterostructure. 

Other important semiconductor pairs used in heterostructure are 
AlAs/GaAs, InAs/GaSb, GaP/Si, and ZnSe/GaAs. Good lattice matching in the 
range 0. 1-1.0 percent is often accomplished by use of alloys of different ele- 
ments, which may also adjust energy gaps to meet specific device needs. 

n-N Heterojunction 

As a practical example, consider two n-type semiconductors with a large 
offset of the two conduction bands, as sketched in Fig. 16a for a semiconduc- 
tor pair with a normal band line-up. The n-type material with the higher con- 
duction band edge is labeled with a capital letter as N-type, and the junction 
shown is called an n-N junction. The electron transport properties across the 
junction are similar to those across a Schottky barrier. Far from the interface 
the two semiconductors must be electrically neutral in composition. However, 
the two Fermi levels, each determined by the doping, must coincide if there is 
to be zero net electron transport in the absence of an external bias voltage. 

These two considerations fix the “far-off” conduction band edge energies 
relative to the Fermi level, as in Fig. 16b. The combination of a specified band 
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Figure 16 (a) Two semiconductors not in contact; the absolute band edge energies are labeled E c 
for the conduction band edge and E t for the valence band edge. An “absolute energy” means re- 
ferred to infinite distance. The Fermi levels in the two materials are determined by the donor con- 
centrations, as well as by the band structure, (b) The same semiconductors as a heterojunction, so 
that the two parts are in diffusive equilibrium. This requires that the Fermi level (F.L.) be inde- 
pendent of position, which is accomplished by transfer of electrons from the N-side to the n-side 
of the interface. A depletion layer of positively ionized donors is left behind on the N-side. 


offset (determined by the host material composition) at the interface and the 
distant band energies (determined by the Fermi level) can be reconciled only 
if the bands bend near the interface, as in the figure. The necessary band 
bending is created by space charges consequent to the transfer of electrons 
from the N-side to the lower n-side. This transfer leaves behind on the N-side 
a positive donor space charge layer, which through the Poisson equation of 
electrostatics is the source of the positive second derivative (upward curva- 
ture) in the conduction band edge energy on that side. 

On the n-side there is now a negative space charge because of the excess 
of electrons on that side. The layer of negative space charge gives a negative 
second derivative (downward curvature) in the conduction band edge energy. 
On the n-side the band as a whole bends down toward the junction. This dif- 
fers from the usual p-n junction. The downward bending and the potential 
step form a potential well for electrons. The well is the basis for the new physi- 
cal phenomena characteristic of heterostructure physics. 

If the doping on the n-side (low E c ) is reduced to a negligible value, there 
will be very few ionized donors on that side in the electron-rich layer. The mo- 
bility of these electrons is largely limited only by lattice scattering, which falls 
off sharply as the temperature is lowered. Low-temperature mobilities as high 
as 10 7 cm z V _1 s _1 have been observed in GaAs/(Al, Ga)As. 

If now the thickness of the N-side semiconductor is reduced below the de- 
pletion layer thickness on that side, the N material will be entirely depleted of 
its low-mobility electrons. All of the electrical conduction parallel to the inter- 
face will be carried by the high-mobility electrons on the n-side, equal in num- 
ber to the number of ionized N-side donors, but spatially separated from them 
by the potential step. Such high-mobility structures play a large role in solid 
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state studies of 2D electron gases and also in new classes of high-speed field 
effect transistors for computer applications at low temperatures. 

SEMICONDUCTOR LASERS 

Stimulated emission of radiation can occur in direct-gap semiconductors 
from the radiation emitted when electrons recombine with holes. The electron 
and hole concentrations created by illumination are larger than their equilib- 
rium concentrations. The recombination times for the excess carriers are 
much longer than the times for the conduction electrons to reach thermal 
equilibrium with each other in the conduction band, and for the holes to reach 
thermal equilibrium with each other in the valence band. This steady-state 
condition for the electron and hole populations is described by separate Fermi 
levels fL c and p,. for the two bands, called quasi-Fermi levels. 

With /jl c and p ., referred to their band edges, the condition for population 
inversion is that 

M*>/b + e g • (38) 

For laser action the quasi-Fermi levels must be separated by more than the 
band gap. 

Population inversion and laser action can be achieved by forward bias of 
an ordinary’ GaAs or InP junction, but almost all practical injection lasers em- 
ploy the double heterostructure proposed by H. Kroemer (Fig. 17). Here the 
lasing semiconductor is embedded between two wider-gap semiconductor re- 
gions of opposite doping, creating a quantum well that confines both electrons 
and holes. An example is GaAs embedded in (Al,Ga)As. In such a structure 
there is a potential barrier that prevents the outflow of electrons to the p-type 
region, and an opposite potential barrier that prevents the outflow of holes to 
the n-type region. 

The value of p. c in the optically active layer lines up with p ;l in the n con- 
tact; similarly, p. t lines up with p, p in the p contact. Inversion can be achieved if 
\ve apply a bias voltage larger than the voltage equivalent of the active layer 
energy gap. The diode wafer provides its own electromagnetic cavity; for the 
reflectivity at the crystal-air interface is high. Crystals are usually polished to 
provide two flat parallel surfaces; the radiation is emitted in the plane of the 
heterojunctions. 

Crystals with direct band gaps are required normally for junction lasers. 
Indirect gaps involve phonons as well as photons; carriers recombine less effi- 
ciently because of competing processes, and no laser action has been observed 
in indirect gap semiconductors. 

Gallium arsenide has been widely studied as the optically active layer. It 
emits in the near infrared at 8383 A or 1.48 eV; the exact wavelength depends 
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Figure 17 Double heterostructure injection laser. Electrons flow from the right into the optically- 
active layer, where they form a degenerate electron gas. The potential barrier provided by the 
wide energy gap on the p side prevents the electrons from escaping to the left. Holes flow from 
the left into the active layer, but cannot escape to the right. 


on temperature. The gap is direct (Chapter 8). In a heterojunction the system 
is very efficient: the ratio of light energy output to dc electrical energy input is 
near 50 percent, and the differential efficiency for small changes is up to 90 
percent. 

The wavelength can be adjusted over a wide range in the alloy system 
Ga,In 1 _ r P !/ As 1 _ !/ , so that we can match the laser wavelength to the absorption 
minimum of optical fibers used as a transmission medium. The combination of 
double heterostructure lasers with glass fibers forms the basis of the new light- 
wave communication technology that is rapidly replacing transmission of sig- 
nals over copper lines. 


LIGHT-EMITTING DIODES 

The efficiency of light-emitting diodes is now at the point of exceeding in- 
candescent lamps. Consider a p-n junction with a voltage source V splitting 
the two chemical potentials pu„ and /i p by eV, as in Figure 18. Electrons from 
the n side are injected into the p side, and holes from the p side are injected 
into the n side. These injected carriers annihilate each other across the junc- 
tion, thus generating photons if the quantum efficiency is unity. 
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The generation or recombination process will be much stronger in a 
direct-gap semiconductor (Fig. 8.5a) than in an indirect gap semiconductor 
(Fig. 8.5b). In a direct-gap semiconductor such as GaAs, the band-to-band 
photons are absorbed in a distance =1 /im, which is strong absorption. The 
direct-gap ternary semiconductor GaAsj-* P x gives light tuned to shorter 
wavelengths as the composition variable x is increased. This composition was 
made by Holonyak into one of the first p-n diode lasers and into the first visi- 
ble-spectrum (red) LED. Blue-emitting heterostructures have now been 
made, such as In I Ga 1 _ I N — Al^Ga^N. 

The performance of LEDs has increased markedly over the years, from 
about 0.1 lumens/watt in 1962 to about 40 lumens/watt in 2004, compared 
with 15 lumens/watt for a standard white unfiltered incandescent lamp. To 
quote Craford and Holonyak, “We are entering an entirely new era in lighting 
(illumination) with an ultimate form of lamp — a direct-gap III-V alloy p-n 
heterostructure.” 
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Figure 19 The diffraction pattern from a single line of lattice constant a in a monochromatic 
x-ray beam perpendicular to the line, (a) The condition for constructive interference is a cos 0 = n\, 
where n is an integer, (b) For given n the diffracted rays of constant A lie on the surface of a cone. 


Problems 

1. Diffraction from a linear array and a square array. The diffraction pattern of a 
linear structure of lattice constant a is explained’ in Fig. 19. Somewhat similar 
structures are important in molecular biology: DNA and many proteins are linear 
helices, (a) A cylindrical film is exposed to the diffraction pattern of Fig. 19b; the 
axis of the cylinder is coincident with the axis of the linear structure or fiber. De- 
scribe the appearance of the diffraction pattern on the film, (b) A flat photographic 
plate is placed behind the fiber and normal to the incident beam. Sketch roughly 
the appearance of the diffraction pattern on the plate, (c) A single plane of atoms 
forms a square lattice of lattice constant a. The plane is normal to the incident x-ray 
beam. Sketch roughly the appearance of the diffraction pattern on the photographic 
plate. Hint: The diffraction from a plane of atoms can be inferred from the patterns 
for two perpendicular lines of atoms, (d) Figure 20 shows the electron diffraction 
pattern in the backward direction from the nickel atoms on the (110) surface of a 
nickel crystal. Explain the orientation of the diffraction pattern in relation to the 
atomic positions of the surface atoms shown in the model. Assume that only the sur- 
face atoms are effective in the reflection of low-energy electrons. 

’Another viewpoint is useful: for a linear lattice the diffraction pattern is described by the 
single Laue equation a • Ak = 2irq, where q is an integer. The lattice sums which led to the other 
Laue equations do not occur for a linear lattice. Now a • Ak = constant is the equation of a plane; 
thus the reciprocal lattice becomes a set of parallel planes normal to the line of atoms. 



(a) 


<b> 

Figure 20 (a) Backward scattering pattern of 76 eV electrons incident normally on the (110) face 
of a nickel crystal; a model of the surface is shown in (b). (Courtesy of A. U. MacRae.) 


2. Surface subbands in electric quantum limit. Consider the contact plane be- 
tween an insulator and a semiconductor, as in a metal-oxide-semiconductor transis- 
tor or MOSFET. With a strong electric field applied across the Si0 2 -Si interface, 
the potential energy of a conduction electron may be approximated by V(x) = eEx 
for x positive and by V{x) = °° for x negative, where the origin of x is at the inter- 
face. The wavefunction is 0 for x negative and may be separated as < p(x,y,z) = u(x ) 
exp [i(k tJ y 4- k.z)], where u(x ) satisfies the differential equation 

— (h 2 /2m)d 2 u/dx 2 + V(x)u = eu . 

With the model potential for V(x) the exact eigenfunctions are Air)' functions, but 
we can find a fairly good ground state energy from the variational trial function x 
exp(— ax), (a) Show that (e) = (h z /2m)a 2 + 3eE/2a. (b) Show that the energy is a 
minimum when a = (3eEm/2h 2 ) m . (c) Show that (e) mln = 1.89(ft 2 /2m) 1/3 ( 3eE/2)' m . 
In the exact solution for the ground state energy the factor 1.89 is 
replaced by 1.78. As E is increased the extent of the wavefunction in the x direction 
is decreased. The function u(x) defines a surface conduction channel on the semi- 
conductor side of the interface. The various eigenvalues of u(x) define what are 
called electric subbands. Because the eigenfunctions are real functions of x the 
states do not carry' current in the x direction, but they do carry a surface channel 
current in the yz plane. The dependence of the channel on the electric field E in the 
x direction makes the device a field effect transistor. 

3. Properties of the two-dimensional electron gas. Consider a two-dimensional elec- 
tron gas (2DEG) with twofold spin degeneracy but no valley degeneracy, (a) Show 
that the number of orbitals per unit energy is given by: D(e) = m/rrh 2 . (b) Show that 
the sheet density is related to the Fermi wavevector by: n s = k F / 2 tt. (c) Show that, in 
the Drude model, the sheet resistance, i.e., the resistance of a square segment of the 
2DEG, can be written as: R s = (h/e 2 )/(k F t) where € = is the mean free path. 
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Figure 1 Schematic and scanning electron microscope (SEM) image of a gate electrode pattern 
on a GaAs/AlGaAs heterostructure used to create a quantum dot of complex shape in the underly- 
ing two-dimensional (2D) electron gas. (Courtesy of C. Marcus.) 
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CHAPTER 18: NANOSTRUCTURES 


The previous chapter addressed solids with spatial confinement at the 
nanometer scale along one direction: surfaces, interfaces, and quantum wells. 
These systems were effectively two-dimensional, which we define as extended 
in two directions but of nanometer scale in the third. Only a small number of 
quantized states — often only one — are occupied in the confined direction. In 
this chapter we discuss solids confined in either two or three orthogonal direc- 
tions, creating effectively one-dimensional (ID) or zero-dimensional (OD) 
nanostructures. Important ID examples are carbon nanotubes, quantum 
wires, and conducting polymers. Examples of OD systems include semiconduc- 
tor nanocrystals, metal nanoparticles, and lithographically patterned quantum 
dots. Some examples are shown in Figs. 1 to 3. We will almost exclusively focus 
on nanostructures that are created from confined periodic solids. Nonperiodic 
nanostructures are of great interest in other fields, such as molecular assem- 
blies in chemistry and organic macromolecules in biology. 

The techniques for the creation of nanostructures can be divided into two 
broad categories. Top-down approaches use lithographic patterning to struc- 
ture macroscopic materials at the nanoscale, such as the metallic electrodes 
on top of a semiconductor heterostructure shown in Fig. 1. Bottom-up ap- 
proaches utilize growth and self-assembly to build nanostructures from atomic 
or molecular precursors. A CdSe nanocrystal grown in solution is shown in 
Fig. 2. It is typically difficult to create structures smaller than 50 nm with 



Figure 2 Model and transmission electron microscope (TEM) image of a CdSe nanocrystal. 
Individual rows of atoms are clearly resolved in the TEM image. (Courtesy of A. P. Alivisatos.) 
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Figure 3 Atomic force microscope (AFM) image of a pair of crossed carbon nanotubes contacted 
by Au electrodes patterned by electron beam lithography. (Image courtesy of M. S. Fuhrer.) Also 
shown is a model of the nanotube cross region, showing the honeycomb lattice of the graphene 
sheets that form the nanotube walls. (Courtesy of P. Avouris.) 


top-down techniques, while it is often difficult to create structures larger than 
50 nm by bottom-up techniques. A major challenge of nanoscience and tech- 
nology is to combine these approaches and develop strategies to reliably create 
complex systems over all length scales, from the molecular to the macroscopic. 
Figure 3 shows one example, where 100-nm-wide lithographic electrodes make 
contact to 2-nm-wide carbon nanotubes grown by chemical vapor deposition. 

When the extent of a solid is reduced in one or more dimensions, the 
physical, magnetic, electrical, and optical properties can be dramatically al- 
tered. This makes nanostructures a subject of both fundamental and practical 
interest; their properties can be tailored by controlling their size and shape on 
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the nanometer scale. One class of effects is related to the large ratio of number 
of surface atoms to bulk atoms in a nanostructure. For a spherical nanoparticle 
of radius R composed of atoms with an average spacing a, the ratio is given by 

N sarf /N = 3a/R . (1) 

For R — 6a~ 1 nm, half of the atoms are on the surface. The large surface 
area of nanoparticles is advantageous for applications in gas storage, where 
molecules are adsorbed on the surfaces, or in catalysis, where reactions occur 
on the surface of the catalyst. It also has dramatic effects on the stability of the 
nanoparticle. The cohesive energy is dramatically lowered because atoms on 
the surface are incompletely bonded. Nanoparticles therefore melt at temper- 
atures far below the melting temperature ol the corresponding bulk solid. 

The fundamental electronic and vibrational excitations of a nanostructure 
also become quantized, and these excitations determine many of the most impor- 
tant properties of the nanostmetured material. These quantization phenomena 
■will be the primary subject of this chapter. Typically, they are important in the 
1-100-nanometer size range. 

IMAGING TECHNIQUES FOR NANOSTRUCTURES 

The development of new techniques to image and probe nanostructures 
has been essential to the evolution of the field. For periodic 3D structures, the 
diffraction of electrons or X-rays can be used to determine structure in recip- 
rocal space, which can then be inverted to find the real-space atomic arrange- 
ments, as discussed in Chapter 2. For individual nanoscale solids, diffraction is 
only of limited utility for both fundamental and practical reasons. The solid’s 
small size interrupts the periodicity of the lattice, blurring diffraction peaks, 
and also produces a very small scattered signal. 

Real-space probes that can directly determine the properties of the nano- 
structure are therefore very valuable. These probes use the interaction of a 
particle, typically an electron or photon, with the object under study, to create 
an image. The techniques fall into two major classes, which we will refer to as 
focal and scanned probe. 

In focal microscopy, the probe particle is focused by a series of lenses onto 
the sample. Figure 4 shows a schematic. The ultimate resolution of the system 
is limited by the wavelike nature of the particle through the Heisenberg Un- 
certainty Principle, or, equivalently, diffraction. This smallest feature spacing 
d that can be resolved is given by 

A/2/3 , (2) 

where A is the wavelength of the probe particle and /3 — sin 6 is the numerical 
aperture defined in Fig. 4. Achieving nanoscale resolution requires using par- 
ticles with small wavelengths and maximizing the numerical aperture. 
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Figure 4 Schematic diagram of a focal mi- 
croscope. A beam emitted from a source is 
focused onto the sample by a series of lenses. 
An equivalent focal system can be used to 
focus particle/waves emitted from the sample 
onto a detector. 



In scanned probe microscopy, by contrast, a tiny probe is brought close to 
the sample and scanned over its surface. The resolution of the microscope is 
determined by the effective range of the interaction between the probe and 
the structure under study, rather than by the wavelength of the probe particle. 

In addition to imaging, scanned and focal probes provide information 
about the electrical, vibrational, optical, and magnetic properties of individual 
nanostructures. Of particular importance is the electronic structure, expressed 
in the density of states. For a finite-sized system, the density of states is a se- 
ries of delta functions 

D(e) = 2S(s - Sj ) , ( 3 ) 

j 

where the sum is taken over all the energy eigenstates of the system. For 
extended solids, the density of states can be represented by a continuous func- 
tion, but for a nanostructure the discrete sum form is necessary along the con- 
fined directions. This quantized density of states determines many of the most 
important properties of nanostructures, and it can be directly measured using 
the techniques described below. 

Electron Microscopy 

A very powerful focal tool is the electron microscope. A collimated beam 
of electrons is accelerated by high voltages and focused through a series of 
electrostatic or magnetic lenses onto the sample under study. 

In transmission electron microscopy, or TEM, the electron beam trav- 
els through the sample and is focused on a detector plate in much the same 
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way as the image is focused onto the eyepiece of an optical microscope. The ul- 
timate resolving power d is set by the wavelength of the accelerated electrons 

d = A/2y3 = 0.6nm/(/3Vv) , (4) 

where V is the accelerating voltage (measured in volts). For typical accelerat- 
ing voltages (100 kV), the theoretical resolving power is therefore subatomic. 
Other effects, such as imperfections in the lenses, keep the TEM resolution well 
above this limit, but ~ 0. 1 nm has been achieved. Figure 2 shows a TEM 
image of a semiconductor nanocrystal, where rows of atoms are clearly resolved. 

A major limitation of TEM is that the electron beam must penetrate the 
sample, making it impossible to examine structures on solid substrates. This 
problem is overcome in the scanning electron microscope (SEM). In an 
SEM, a high-energy (100 V to 100 kV), tightly focused electron beam is scanned 
over the sample. The number of backscattered electrons and/or the secondary 
electrons generated by the beam that emerge from the sample depends on the 
local composition and topography of the sample. These electrons are collected by 
an electron detector, and an image is formed by plotting this detector signal as a 
function of the beam location. This powerful technique can be used on most 
kinds of samples, but it typically has a lower resolution (>1 nm) than the TEM. 
Figure 1 is an SEM image of metallic electrodes on a GaAs/AlGaAs substrate. 

In addition to imaging, the SEM beam can be used to expose an electron- 
sensitive material and draw small features in a technique known as electron 
beam lithography. The ultimate resolution (<10 nm) is very high, but it is a 
slow process because the patterns must be drawn pixel by pixel. It is therefore 
used primarily in research, prototyping, and optical mask fabrication. 

Optical Microscopy 

The optical microscope is the prototypical focal instrument. Using visi- 
ble light and a high numerical aperture (/3 == 1), the highest obtainable resolu- 
tion is 200-400 nm. For direct imaging, optical microscopy therefore only 
reaches the edge of the nanoscale realm. However, many of the optical spectro- 
scopies discussed in Chapter 15 have been successfully adapted to study 
individual nanostructures. These include elastic light scattering, absorption, 
luminescence, and Raman scattering. Measurements of a single nanostructure, 
or even a single molecule, are possible if only one is in the field of view of the 
microscope. 

Here we briefly review the emission and absorption of electromagnetic 
radiation by matter in a manner suitable for applications to nanostructures. 
Within the electric dipole approximation, Fermi s golden rule gives the transition 
rate between an initial state t and a higher energy state j due to absorption: 

u ;, = (27r/ft)[(j'|eE , r|i)| 8(ej — e ; — hw) . 


( 5 ) 
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Transitions therefore occur between states that have a nonzero dipole matrix 
element and whose energies differ by the absorbed photon energy hw. Simi- 
larly, the emission rate from state / to i is given by 

= (2-7r/ft)|{j|f?E-r|i)| 8(e i - £, + ha>) + (4aw|/c 2 )|{jjr|j}| , (6) 

where cUj f = (e f — e t )/h and a is the fine structure constant. The first and sec- 
ond terms represent stimulated and spontaneous emission, respectively. 

By summing over all possible states, these relations can be used to calcu- 
late the total power a ' E 2 absorbed from the electromagnetic field and hence 
the real part of the conductivity: 

cr'(w) = (WW V)X|0'|“' r l i )| L/fe) ~fl s j)] s ( s j ~ s i ~ *“>) > (7) 

v 

where n is a unit vector pointing in the direction of the electric field. The 
absorption is proportional to the joint density of all initial and final states sepa- 
rated by an energy hw, weighted by the dipole matrix element and the occupa- 
tion factors of the states. The Fermi functions indicate that absorption only 
occurs when the initial state i is filled and the final state / is empty. 

The above relations show that absorption and emission can be used to 
probe the electronic energy level spectra of nanostructures. Measurements can 
readily be performed on macroscopic collections of nominally identical nanos- 
tructures, but the effects of inhomogenous broadening due to the variation in 
the properties of the individual nanostructures are significant. Furthermore, 
sometimes only a few or even a single nanostructure is available for measure- 
ment. Optical measurements that probe single nanostructures have therefore 
proven to be particularly valuable. 

Figure 5 shows an example of spontaneous emission, or fluorescence, from 
individual optically excited semiconductor quantum dots. The emission occurs 
from the lowest energy state, in the conduction band to the highest energy state 
in the valence band. The linewidths of the emission lines from single nanocrys- 
tals are very narrow, but they are distributed over a range of energies due to 
variations in the nanocrystal size, shape, and local environment. Measurements 
of an ensemble therefore show a broad peak that does not accurately reflect the 
properties of a single nanocrystal. 

In addition to their use in probing nanostructures, optical focal systems 
arc also widely used for microfabrication. In projection photolithography, a 
pattern on a mask is projected onto a photosensitive resist using optical 
elements. Following exposure and development of the resist, the pattern is 
transferred into the material of interest by etching or deposition through the 
resist stencil. Optical lithography is the basis for the mass-fabrication of 
microelectronic and micromechanical systems. By using wavelengths into the 
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Figure 5 Left: Image of the fluorescence from individual CdSe nanocrystals dilutely distributed 
on a surface at T = 10 K. Right: Spectra of the fluorescence of a number of different individual 
nanocrystals. In each spectrum, the high-energy peak is the primary transition between the lowest 
electronic state in the conduction band and the highest energy state in the valence band. The 
lower energy peaks are associated transitions involving the emission of an LO phonon. Variations 
in the nanocrystal size and local electronic environment shift the positions of the peaks. The broad 
peak is the spectrum obtained for an ensemble of nominally identical nanocrystals. (After 
S. Empedocles et al.) 


deep-UV, devices with features of 100 nm are in commercial production. Fur- 
ther improvements using Extreme UV light or even X-rays are possible, but 
the masks and focusing elements become more and more challenging to fabri- 
cate and control. 

Scanning Tunneling Microscopy 

The most famous scanned probe instrument is the scanning tunneling 
microscope (STM), schematically shown in Fig. 6. Its invention was a break- 
through in the field of nanoscience. In an STM, a sharp metal tip, preferably 
one with a single atom protruding from the end, is brought to within a 
nanometer of the conducting sample to be studied. The position of the tip is 
controlled with picometer precision using piezoelectric materials that expand 
or contract in response to electrical signals from a control system. A voltage 
bias V is applied to the sample, and a tunneling current I flowing between 
the tip and the sample is measured. The current is proportional to 3, the 
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(a) 



(b) 


Figure 6 Schematic of a scanning tunneling microscope (STM). When operated in feedback 
mode, the piezos scan the tip over the sample and maintain a constant tunneling current between 
the tip and the sample. (Courtesy of D. LePage.) Lower: STM image of a carbon nanotube. (Cour- 
tesy of C. Dekker.) 


tunneling probability through the gap between the tip and sample. The 
tunneling probability is exponentially sensitive to the tunneling distance. In 
the WKB approximation, 

3 exp(— 2\/ 2m<f>/fr z) , (8) 

where z is the distance between the tip and sample and </> is the effective bar- 
rier height for tunneling. For typical parameters, a 0.1-nm change in the tip 
position leads to an order of magnitude change in 3 . 

When the STM operates in feedback mode, I is maintained at a con- 
stant value by changing the tip height z. The STM thus tracks the surface 
topography, and very' small changes in the height of the surface can be detected 



18 Nanostructures 


525 



Figure 7 A “quantum corral” of mean radius 7.1 nm was formed by moving 48 Fe atoms on a Cu 
(111) surface. The Fe atoms scatter the surface state electrons, confining them to the interior of 
the corral. The rings in the corral are the density distribution of the electrons in the three quan- 
tum states of the corral that lie close to the Fermi energy. The atoms were imaged and moved into 
position by a low-temperature, ultra-high vacuum scanning tunneling microscope. (Image cour- 
tesy of D. M. Eigler, IBM Research Division.) 


(<1 pm). This is illustrated in Fig. 6, where an STM image of a carbon 
nanotube is shown. The STM can also be used to manipulate individual atoms 
on a surface. An example is shown in Fig. 7, where the STM tip is used to con- 
struct a “quantum corral” by pushing Fe atoms on a Cu (111) surface into 
a ring. 

The STM tunneling current I as a function of bias V can give spatial and 
spectroscopic information about the quantum states of a nanostructure. At 
zero temperature, the derivative of the current with respect to voltage is 

dl/dV* SSk/r,)! 2 S(s F + eV -e } ) . (9) 

j 

It is proportional to the density of states at the tunneling electron energy 
e F + eV, weighted by the electron probability density of those states at the 
STM tip position r t . 

For the quantum corral, the electrons in the 2D surface state of Cu are re- 
flected by the Fe atoms, creating a discrete set of states in the interior of the 
corral. The observed ripples in the image in Fig. 7 are due to the modulations 
of the probability density l«/q(r t )l 2 of these localized states near the tunneling 
electron energy. Images at different bias voltages yield the spatial structure of 
quantized states at different energies. 
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Atomic Force Microscopy 

The atomic force microscope (AFM) was developed soon after the STM. 
It is a much more flexible technique than STM and can be used on both con- 
ducting and insulating samples. However, it typically has poorer resolution. An 
AFM measures the force between the tip and the sample, rather than the tun- 
neling current. A sharp tip is mounted on the end of a millimeter-sized can- 
tilever, as shown in Fig. 8. A force F exerted on the tip by the sample deflects 
the cantilever by A z: 

F = CAz , (10) 

where C is the force constant of the cantilever. The displacement of the can- 
tilever is measured as a function of tip position, often by using the back of 
the cantilever as a reflector for a laser beam (Fig. 8). Motion of the reflector 
changes the path of the laser beam, which is detected using a photodiode array; 
picometer-scale displacements can easily be measured. Since a typical value of 
the force constant is C = lN/m, pN-scale forces can be transduced. Forces well 
below 1 fN have been measured under special circumstances. 

The simplest mode of operation is contact mode, where the tip is dragged 
along in contact with the surface and the cantilever deflection is measured. 
This gives a measure of the sample topography, but it can damage the sample. 
Noncontact or intermittent-contact imaging modes are less invasive, and they 
also can give information about the long-range forces between the sample and 
the tip. In these techniques, the cantilever oscillates just above the sample 
due to an applied driving force of amplitude F 0J near the cantilever resonance 



Figure 8 (a) Schematic of an atomic force microscope (AFM). Deflections of the cantilever are 
measured by a photodetector registering the position of a laser beam that reflects off the top of 
the cantilever. (Courtesy of Joost Frenken.) Inset: SEM image of an AFM tip. The effective 
radius of curvature of the tip can be less than 10 nm. 
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frequency o> () . Modeling the cantilever as a driven simple harmonic oscillator, 
the magnitude of the cantilever response at a frequency cu is given by 

^ a>0 ( 11 ) 

C [(^-col)Z + (axoJQ)T ’ 

where Q, the quality' factor of the oscillator, is the ratio of the energy' stored 
in the cantilever to the energy dissipated per cycle. Note that on-resonance, 
( o — n> 0 , the response is Q times larger than at low frequencies, making the 
detection of small forces possible. 

The parameters characterizing the oscillating cantilever are sensitive to 
any forces that occur between the tip and the sample. These forces can be van 
der Waals, electrostatic, magnetic, or many others. The interaction shifts the 
resonance frequency &> Q and/or modifies Q. This change is recorded and used 
to construct an image. For example, in tapping mode imaging, the tip ‘taps’ the 
surface during the closest approach of the oscillation cycle, causing both a fre- 
quency' shift and additional dissipation. The nanotube device shown in Fig. 3 is 
imaged in tapping mode. 

Another important technique is Magnetic Force Microscopy (MFM), 
briefly discussed in Chapter 12. The tip is coated with a magnetic material 
so that it has a magnetic moment /x normal to the surface of the sample. It 
then feels a force due to variations in local magnetic fields produced by the 
sample 

F(z 0 + Az) = F(z 0 ) + dF/dz\ z=Zti Az = ix(dB/dz)\,^ + Md 2 £/dz 2 )| 3=;i(j Az , (12) 

where z„ is the tip’s equilibrium position and A z is the displacement during the 
oscillation. The term fx(dB/dz) produces a static deflection of the cantilever, 
but does not alter the oscillation frequency or the damping. The term 
n(d 2 B/dz 2 )Az , on the other hand, has the form of a force constant change SC, 
since it is linear in the displacement Az of the cantilever. It therefore shifts the 
resonance frequency of the cantilever. Monitoring this frequency shift 
produces an image. Gradients of other local force fields can be similarly 
measured. 

There are many other scanned probe techniques. Ncar-field scanning 
optical microscopy (NSOM) creates optical images with a resolution below 
the diffraction limit by using a scanned subwavelength aperture through 
which photons ‘tunnel’. Scanning capacitance microscopy (SCM) measures 
capacitance variations between the tip and the sample as a function of posi- 
tion. This ever-growing family of techniques is increasingly used to character- 
ize objects ranging from individual molecules to Si transistors in integrated 
circuits. 
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ELECTRONIC STRUCTURE OF ID SYSTEMS 


The quantized electronic states of nanostructures determine their electri- 
cal and optical properties, and they influence the physical and chemical prop- 
erties as well. To describe these states, we take as our starting point the band 
structure of the bulk material. An effective mass approximation is used for the 
electronic dispersion of a given band, and the associated wavefunctions are 
treated as plane waves. These are simplifications; the bands are not always par- 
abolic, and the true eigenstates are Bloch states, not plane waves. However, 
these assumptions greatly simplify the mathematics and are qualitatively (and 
often quantitatively) correct. We will also often neglect the Coulomb interac- 
tions between electrons. However, there are many cases in the physics of 
nanostructures where electron— electron interactions cannot be ignored, as dis- 
cussed later in this chapter. 

One-dimensional (ID) Subbands 

Consider a nanoscale solid in the geometry of a wire. Its dimensions along 
the x and y are nanoscale, but it is continuous in z. The energies and eigen- 
states of such a wire are given by 

e — £, ; + irk 1 / 2m ; iff(x,y,z) = i/fjj(x,y)e' k ~ , (13) 

where i and j are the quantum numbers labeling the eigenstates in the x,y 
plane and k is the wavevector in the z direction. For the rectangular wire 
shown in Fig. 9, e t j and if/ t j (x,y) are just particle-in-a-box energies and eigen- 
states discussed in Chapter 6. 

The dispersion relation consists of a series of ID subbands, each corre- 
sponding to a different transverse energy state £y,. The total density of elec- 
tronic states D(e) is the sum of the density of states of the individual subbands: 


D(e) = 2 D iM) > (14) 

v 

where Djj(e) is given by 


dNj j JJ. r 


m 


2ft 2 (e 




4 L , 

- — tor e > e t 
hv i.i J 


(15) 


— 0 for s < Sj j . 

The first factor of two in the middle expression is due to spin degeneracy and 
the second from including both positive and negative values of k. In the right 
expression, v j f is the velocity of the electron in the i,j subhand with kinetic 
energy e—Sij. Note that the density' of states diverges as (c ~ £y) -1/2 at each 
subband threshold. These are called van Hove singularities. This behavior 
stands in contrast to three dimensions, where D( e) goes to zero at low energies 
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Figure 9 Schematic of a rectangular quasi-one-dimensional wire, along with the dispersion rela- 
tions and the density of states of the ID subbands. The peaks in the density of states at the sub- 
band thresholds are called Van Hove singularities. The probability density for the i = 2 ,j = 1 state 
is shown as a gray scale on the cross section of the wire. 

(Chapter 6), and two dimensions, where D(e) steps up a constant value at the 
bottom of each 2D subband (Prob. 17.3) 

Spectroscopy of Van Hove Singularities 

The Van Hove singularities described by (15) affect the electrical and opti- 
cal properties of ID systems. Here, we discuss the case of a semiconducting 
carbon nanotube, whose band structure is calculated in Prob. 1 and shown in 
Fig. 10a. Van Hove singularities are seen in scanning tunneling spectroscopy, 
as shown in Fig. 10b. Peaks in the differential conductance, which is propor- 
tional to the density of states by (9), are observed at bias voltages correspond- 
ing to the energies of these singularities. 

The optical absorption and emission of semiconducting nanotubes are also 
dominated by these singularities, since they depend on the initial and final 
density of states by (5)-(7). Figure 10c shows the photoluminescence intensity 
of a collection of carbon nanotubes as a function of the wavelength of the ex- 
citing and emitted light. The absorption of the incident light is enhanced when 
the energy matches s c2 — e v2 , the energy difference between the 2nd Van Hove 
singularities in the conductance and valence bands. The electrons and holes 
relax quickly to the bottom of the first subband, where they recombine, 
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(c) 

Figure 10 (a) The density of states for a semiconducting carbon nanotube as a function of 
energy. The Van Hove singularities are seen in the STM tunneling spectra of a nanotube shown in 
(b). In (c), the emission intensity is plotted as a function of the emission wavelength and the exci- 
tation wavelength. Peaks in the intensity are observed when the absorption and emission energies 
correspond to those shown in the diagram (a). Different peaks correspond to nanotubes with 
different radii and chirality. [After Bachilo et al. (a and c) and C. Dekker (b).] 
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producing luminescence with energy e cl — e vl . Peaks are therefore observed 
when the emitted and absorbed light simultaneously match the energies be- 
tween the 1st and 2nd van Hove singularities, respectively. Different peaks in 
the emission intensity in the plot correspond to nanotubes of varying diame- 
ters and chiralities. 

ID Metals — Coulomb Interactions and Lattice Couplings 

In a quasi-one-dimensional metal, electrons fill up individual ID sub- 
bands, with the Fermi energy and the total number of subbands occupied de- 
termined by the electron density. For a strictly ID metal, there is only one 
(spin-degenerate) subband occupied. In this case, if there are n w carriers per 
unit length: 

n w = 2k F /ir. (16) 

The Fermi surface of a ID metal consists of just two points, at +k F 
and — k F , as shown in Fig. 11. This is quite different from the Fermi surfaces 
in 3D and 2D free-electron metals, which consist of a sphere and a circle, 
respectively. Two consequences of this unusual Fermi surface are discussed 
below. 

Coulomb interactions cause scattering among electrons near the Fermi en- 
ergy. For 3D metals, scattering is strongly suppressed near e F by the restrictions 
of energy/momentum conservation combined with the Pauli exclusion principle. 
At an energy e measured relative to s F , 1/t,, ( , ~ (l/r 0 ) (e/e F ) 2 , where l/r 0 is the 




Figure 11 Electronic structure of a ID metal near the Fermi energy. The Fermi surface consists 
of two points at ± k F . The scattering of electrons from filled states 1 and 2 to empty states 3 and 4 
conserves energy as long as the energy difference is the same between 1 and 3 and between 2 and 
4. Momentum is simultaneously conserved because the energy is locally linear in k. 
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classical scattering rate. By the uncertainty principle, this produces an uncer- 
tainty in the energy of the electron: 

8s(3D) ~ h/T e „ ~ ( A/ToXs/sp f . (17) 

As the energy becomes small (measured relative to the uncertainty in the 
energy goes to zero as the second power of s. The uncertainty 8s is therefore 
guaranteed to be small in comparison to s sufficiently close to s F . This ensures 
that the quasiparticles near the Fermi surface arc well defined. 

The case of one dimension is shown in Fig. 11. Energy and momentum 
conservation are equivalent in this case since for small s the energy is locally 
linear in the momentum change Afc = k~k F : 

e ££ (fi 2 k p /m)Ak . (18) 

Referring to Fig. 11, energy conservation requires that for an electron in state 
1 at energy s to scatter to state 3, a simultaneous scattering of an electron from 
state 2 to 4 must occur. The only restriction is that the final state energy 3 be 
positive and less than e. This gives a reduction factor \/r ee ~ (1/t 0 ) (s/e F ), and, 
from the uncertainty principle, 

8e(lD) — h /r„ e ~ ( h /t 0 )(s/s f ) . (19) 

Since the uncertainty' is linear in e, there is no guarantee that 8s will be 
smaller than e as s — » 0. The fundamental assumption behind Fermi liquid 
theory', that weakly interacting quasiparticles exist as s — > 0, is therefore not 
guaranteed in ID. In fact, the ground state of the interacting ID electron gas 
is believed not to be a Fermi liquid, but rather a Luttingcr liquid whose low- 
energy' excitations are collective in nature. The excitations are more analogous 
to phonons or plasmons — a collective motion of many objects — than isolated 
electrons moving independently of their neighbors. This collective nature has 
a number of effects. For example, tunneling into a ID metal is suppressed at 
low energies because the tunneling electron must excite the collective modes. 
In spite of this issue, the independent electron picture remains a useful ap- 
proximation for the ID electron gas. It has been successful in describing most, 
but not all, experiments on real ID systems, and will be adopted below. 

A second unusual property of ID metals is that they are unstable to per- 
turbations at a w r avevector 2 k P . For example, a distortion of the lattice at this 
w'avevector wall open up a bandgap in the electronic spectrum, converting the 
metal to an insulator. This is the Peierls instability treated in detail in Chapter 
14. This effect is particularly important in ID conducting polymers such as 
polyacetylene (Fig. 12). It has one conduction electron per carbon atom spac- 
ing a, and therefore from (16) k F = ir!2a Without any distortions, polyacety- 
lene would have a half-filled band and be a metal. However, a lattice distortion 
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Figure 12 Structure of polyacetylene. Due to the Peierls distortion, the lattice is dimerized, with 
carbon atoms joined by double bonds in the diagram closer together than those linked by single 
bonds. The Peierls distortion opens a semiconducting gap of approximately 1.5 eV. 


at 2k F = ir/a, corresponding to a wavelength 2a, opens up a gap at the Fermi 
energy. This corresponds to a dimerization of the lattice. This dimerization 
produces alternating single and double bonds along the chain and turns poly- 
acetylene into a semiconductor with a bandgap of 1.5 eV. 

Polyacetylene and related semiconducting polymers can be made into 
field-effect transistors, light-emitting diodes, and other semiconducting de- 
vices. They can also be doped chemically, producing metallic behavior with 
conductivities comparable to traditional metals. However, they retain the me- 
chanical flexibility and ease of processing characteristic of polymers. Their dis- 
covery has led to a revolution in flexible plastic electronics. 

The Peierls distortion is large in polymers because their backbone consists 
of a single atomic chain, which can easily distort. Other ID systems such as 
nanotubes and nanowires are much .stiffer, and the Peierls transition is not ob- 
served at experimentally relevant temperatures. 


ELECTRICAL TRANSPORT IN ID 


Conductance Quantization and the Landauer Formula 

A ID channel has a finite current-carrying capacity for a given voltage ap- 
plied across its ends. It therefore has a finite conductance even if there is no 
scattering in the wire. Consider a wire with one subband occupied connecting 
two larger reservoirs with a voltage difference V between them, as shown in 
Fig. 13. The right-going states will be populated up to an electrochemical po- 
tential jiii and left-going states will he populated up to /x 2 , where — ju, 2 = qV 
and q = — e for electrons and +e for holes. The net current flowing through 
the channel due to the excess right-moving carrier density An is then 


I = A nqv — 


D B (e)qV 
L qV = 




( 20 ) 


where D R (e), the density of states of right-moving carriers, is % the total den- 
sity of states given in (15). 
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Figure 13 (a) The net current propagating between two reservoirs for an applied bias voltage dif- 
ference Vi — V 2 . (b) Schematic representation of transmission probability 3 and reflection proba- 
bility 31 from a barrier in the channel, where 3 + 91 = 1. 


Remarkably, in ID the velocity exactly cancels with the density of states to 
create a current that depends only on the voltages and fundamental constants. 
The two-terminal conductance I/V and resistance V/I are then 

G g = 2 e 2 /h ; R g = h/2e 2 = 12.906 kfl . (21) 

A perfectly transmitting one-dimensional channel has a finite conductance 
whose value is the ratio of fundamental constants. This is called the conduc- 
tance quantum Gg; its inverse is called the resistance quantum Rg. While 
derived here in the effective mass approximation, it is true for a ID band of 
arbitrary dispersion. 

The quantization of conductance is dramatically illustrated in the data in 
Fig. 14. A short quasi- ID channel is formed between two regions of a 2D elec- 
tron gas in a GaAs/AlGaAs heterostructure. As the carrier density of the chan- 
nel is increased, the conductance increases in discrete steps of height 2 e 2 /h. 
Each step corresponds to the occupancy of an additional ID subband in the 
wire. Conductance quantization is also observed in atomic-scale bridges be- 
tween macroscopic metals. 

If the channel is not perfectly conducting, the overall conductance is the 
quantum of conductance times the probability 3 (e F ) for electron transmission 
through the channel (Fig. 13): 

G(s F ) = (2e 2 /h)3(s F ) . 


( 22 ) 



18 Nanostructures 


535 




Figure 14 Conductance quantization in a short channel electrostatically defined in a GaAs/ 
AlGaAs heterostructure at different temperatures. A negative gate voltage V g applied to the metal- 
lic gates on the surface of the sample depletes the carriers in the underlying two-dimensional 
electron gas, creating a narrow channel. The channel is fully depleted of carriers at V g = —2.1V. 
Individual ID subbands become occupied with increasing V K , with each new subband adding a 
conductance of 2e~/h . (Courtesy of H. Van Houten and C. Beenakker.) 


This equation is often called the Landauer Formula. For a quasi-ID system 
with multiple channels, we sum over the contributions of each channel, since 
conductances in parallel add: 


3(* f ) = 2 3,, (* F ) , (23) 

V 

where i, j label the transverse eigenstates. For example, for N perfectly 
transmitted channels in parallel, 3 = IV, as for the data in Fig. 14. 


At finite temperatures or biases, the Fermi-Dirac energy distributions/ of 
the electrons in the left and right leads must be taken into account: 

co 

1(s f ,V,T ) = (2 e/h) j ds\f(s - eV) —/ («)] 3(«) . (24) 

— co 

The net current is simply the difference between the left- and right-moving 
currents, integrated over all energies. 

The Landauer formula (22) directly relates the resistance of a system to the 
transmission properties of the channel. Let us rewrite the resistance for the 
one-channel case in the following way: 


h 1 _ h 3+(l-3) _ h | h 91 

2e 2 3 2e z 3 2e 2 2e 2 3 ’ 


(25) 


where 31 = 1 — 3 is the reflection coefficient. The resistance of the derice is 
the sum of the first term, the quantized contact resistance, and the second term, 
the resistance due to scattering from barriers in the channel. The latter term is 
zero for a perfect conductor. Below we consider an application of the Landauer 
formula to the problem of two barriers in series. We treat this in both the coher- 
ent and incoherent limits of electron propagation between the barriers. 


Two Barriers in Series-Resonant Tunneling 

Consider two barriers in series separated by a distance L, with transmis- 
sion/reflection amplitudes t u r, and t 2 , r 2 , as shown in Fig. 15. These ampli- 
tudes are complex: 

tj = \tj\e'^ ; rj = \rj\e‘ <p, j . (26) 

To calculate the transmission probability' 3 through the entire double barrier 
structure, we need the corresponding transmission amplitude. For an incident 
wave from the left whose amplitude is 1, the amplitudes defined in Fig. 15 arc 
given by 

a = ti + rj? ; b = ar 2 e“ e ; c = at. 1 e l ' f ' 2 , (27) 

where tp — 2kL is the phase that an electron with kinetic energy' h“lcl2m accu- 
mulates propagating the distance 2 L on a round trip between the barriers. 
Combining these to solve for the transmitted amplitude yields: 


1 —r 1 r 2 e“ f 

The transmission probability through the double barrier is then 


(28) 


3 - 


r 2 


c - 


1+ h Pl r 2 P - 2 l r l ll r 2 |cOs(<p*) 


(29) 


where <p“ = 2 kL + <p rl + <p r2 . 
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L 

Figure 15 Resonant tunneling through two identical barriers in series separated by a length L. 
The upper diagram shows the transmission amplitudes between and outside the barriers for a 
unity amplitude incident wave. The transmission resonances at the energies of the quasibound 
states between the barriers are shown. 


This is plotted in Fig. 15. Note the round-trip phase accumulation <p* includes 
the phase shifts associated with reflections from the barriers. 

The transmission probability (29) is greatly enhanced when cos (<p*) ap- 
proaches unity, because the denominator becomes small. This occurs for the 
resonance condition 


tp* = 2kL + <p rl +(Pr2 — 2,7171 , (30) 

where n is an integer. This is a general property of waves, and is due to the 
constructive interference of many pathways through the sample. This can be 
easily seen by rewriting (28) using the series expansion 1/(1 — x) — ^ x m : 

m—0 

c = = t 1 t 2 e iip ' 2 [l+r 1 r 2 e i ‘ f '+ (r 1 r 2 e ,> f + ] . (31) 

The mth order in the expansion corresponds to a path with m round trips be- 
tween the barriers. On resonance, these paths add in phase to yield a strongly 
enhanced transmission. 

Consider the special case where the barriers are the same: t x = t 2 . We 
then have 


3(<p* = 27rn) =|ti| 4 (l-|r 1 | 2 ) 2 =1 . (32) 

The transmission on resonance through a symmetric double-barrier structure 
is 1, even if the transmission through each of the individual barriers is small. 
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This is called resonant tunneling. Off resonance, the denominator of (29) is of 
order unity for opaque barriers, and the transmission is roughly the product of 
the transmission coefficients of each of the two barriers in series: 3 ~ |t|| 2 |t s | 2 . 

The resonance condition <p* = 2 mi corresponds to the energies of the 
quasibound electronic states confined between the two barriers. For very 
opaque walls, this is just the particle-in-a-box quantization condition: kL = mi. 
We derived the resonant tunneling condition for a one-dimensional case, but it 
is a general result. The transmission through a confined electron system is 
strongly enhanced at energies corresponding to the hound-state energy levels 
of the confined electrons. This is also evident from the STM tunneling expres- 
sion (9); quasibound states produce peaks in the differential conductance. 

For the case of opaque barriers, lt : l 2 ,l# 2 l 2 <§1, the cosine term in the de- 
nominator of (29) can be expanded, as shown in Prob. 3, yielding the familiar 
Breit-Wigner form for a resonance: 


3 (*) = 


4T,r, 


OVT.) 2 + 4(e -ej 


chore 


r = — h. i 2 

’ 277 't' 


(33) 


The resonances are thus Lorentzian peaks with a width in energy of T = 
r x + r 2 determined by the energy level spacing As and the transmission prob- 
abilities through the two barriers. This is just the uncertainty principle broad- 
ening of the level due to the finite lifetime of the double-barrier bound state. 

Incoherent Addition and Ohm’s Law 

If we instead treat the electron classically, we add probabilities rather than 
amplitudes. This is valid if the electron effectively loses track of its phase be- 
tween the barriers due to, for example, inelastic scattering from phonons. This 
corresponds to replacing (27) by 

l«P= |*iP + kiNM 2 ; |h | 2 = ] a P| r 2 | 2 ; | c | 2 = | a | 2 | t 2 | 2 . (34) 


This gives 


Some elementary manipulations (Prob. 4) yield 

R = (h/2e 2 )(l + |r 1 | 2 / pip+ |r 2 p/|f 2 p) 

The resistance is just the sum of the quantized contact resistance and the 
intrinsic resistances of the individual barriers (see Eq. 25). This is Ohms law — 
resistors in series add. It is valid if interference effects can be neglected. 

Equation (36) allows us to connect to the Drude formula. Consider a process 
that gives a backscattering rate l/r h . This backscattering could result from either 
an elastic scattering process such as impurity scattering or from an inelastic 


(35) 

(36) 
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process such as phonon scattering. For propagation over a small distance dL, the 
reflection probability <797 (<S 1) is 


<79t = 


UjL 
T b V F 


dL 

4 


(37) 


This gives a contribution to the resistance, yielding a resistivity: 

p 1D — dR ! dL = (h / 2e 2 ) i . (38) 

This is equal to the ID Drude resistance <r^j = (n lD e 2 T/m) \ as shown in 
Prob. 4. Ignoring interference effects, the resistances of individual segments 
add ohmically, giving 

R = R Q + (h/2e 2 )(L/e b ) . (39) 


Localization 

Now consider when two barriers are connected in series, but coherence is 
not neglected. However, we average over all possible phases, corresponding to 
an average over different energies. From Eq. (29), the average resistance is 

h 1+ l r il 2 l r 2l 2 - 2 killDl(cosy > ) _ h l + hl 2 hl 2 

2e 2 \h\%\ 2 2e 2 \t.\%\ 2 ' (40) 



Notably, the phase-averaged resistance (40) is larger than the resistance in the 
incoherent limit (36). 

To understand the scaling with length associated with (40), consider a long 
conductor of length L consisting of a series ol only elastic (phase-preserving) 
scatterers characterized by an elastic backscattering length € t „ Assume that the 
conductor has a large resistance (R), so that 9? = 1 and 3 1. For a small 

additional length dL, there will be an additional reflection and transmission 
d9t = dL/t e , as in (37), and <73 = 1 — </3f Combining these according to the 
prescription of (40), and assuming that c79l 1, gives 

or equivalently, 

(dR) = (R)(2dL/Q . (42) 

Separating variables and integrating both sides of the equation yields 

(r) = (h / 2e s ) exp(2L / € e ) . (43) 

Remarkably, the resistance grows exponentially with the length of the sample, 
rather than linearly as in an ohmic conductor. This behavior is a result of localiza- 
tion. Due to quantum interference among the states scattered by disorder, the 
states become localized on a size scale £ ~ f c , where £ is called the localization 
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length. There are no extended states that traverse the entire length of the con- 
ductor, so the resistance is exponentially large. A similar result holds for quasi-ID 
systems, hut with a localization length £ ~ N£ e , where N is the number of ID 
subbands occupied. 

At very low temperatures, only coherent scattering processes occur and 
the resistance is exponentially large by (43). At finite temperatures, electrons 
retain their phase memory’ only over the phase coherence length i due to their 
interaction with other degrees of freedom such as phonons or electrons. This 
length typically is a power-law function of temperature, t p — AT a . since the 
number of electronic and vibrational excitations present is a power law in T. The 
resistance of each phase coherent segment can be approximated by (43) with £ v 
replacing L. The resistance of each phase coherent segment decreases rapidly 
with increasing temperature (as the exponential of a power law in T ). This 
dramatically decreases the overall resistance, which is the (incoherent) series 
combination of L!£ v such phase coherent sections. At a sufficiently high tem- 
perature where — £ e , all phase coherence is lost between scattering events 
and the ohmic expression (36) is applicable. 

A related issue is the nature of the electronic states in 2D and 3D systems 
in the presence of disorder. In 2D, it is believed that, for noniiiteracting elec- 
trons, all states are also localized by disorder. In 3D, on the other hand, a criti- 
cal amount of disorder is required to localize the states. The subject of local- 
ization continues to be of great fundamental interest and controversy, 
particularly when the effects of coulomb interactions between the electrons 
are included. 

Voltage Probes and the Biittiker-Landauer Formalism 

In many measurements, more than two probes are connected to a conduc- 
tor. Some are used as voltage probes (which draw no net current from the 
sample) and others as current probes, as shown in Fig. 16. Biittiker extended 
the Landauer formalism to deal with this multiprobe case. Define as 

the total transmission probability for an electron leaving contact m to arrive 
at contact n, including the contributions from all the ID channels. For a 
current probe n with N n channels, the electrochemical potential of the contact 
is fixed by an applied voltage, and the net current that flows through the con- 
tact is 


h = (2 e 2 /h)(N n V n -^^VJ . (44) 

m 

This is just the current flowing out of the contact minus the currents flowing in 
that originated from each of the other contacts. Note that N„ - J J Z‘ n ' n K which 

7l\ 

can be easily obtained from (44) by considering the equilibrium case where all 
the voltages are equal and all the currents are zero. 
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Figure 16 Schematic representation of a multiterminal conductor. Contacts 1 and 2 are current 
probes; contact 3 is a voltage probe. The transmission probability from contact 1 to 2 and from 
1 to 3 is schematically indicated. 


For a voltage probe, the potential V„ adjusts itself so that no net current 
flows (/„ = 0): 


V = 


2 3 ( " ,m) V m 



m^n 


(45) 


The electrochemical potential measured by the probe is the weighted average 
of the electrochemical potentials of the different contacts, where the weight- 
ing coefficients are the transmission probabilities. 

Equations (44-45) have a number of surprising consequences. Since the 
measured currents and voltages depend on 3 <n ' m) , the details of the path that 
an electron takes in traversing the sample influences the resistance. A voltage 
probe can disturb the paths, and the measured voltage can in turn be affected 
by transmission through all parts of the sample. Below we present three exam- 
ples that illustrate these properties. 

Consider a voltage probe connected to the center of an otherwise ballistic 
ID conductor, as shown in Fig. 16. Assume that electrons leaving from probe 1 
either arrive at probe 2 or 3, but none are directly backscattered. The voltage 
read by probe 3 is then 


3 (3,1) V 

~ 3 (3,1) + 3 (3,2) 


V 

2 


(46) 


where for the last step we assumed that the voltage probe couples symmetri- 
cally to the left and right moving channels, 3 (3,1) = 3 (3,2) . The voltage mea- 
sured in the channel is just the average of the voltage of the two contacts. 

The current flowing out of contact 1 is given by: 

I = (2 e 2 /h)(V -3 (1,3, V 3 ) = (2e 2 /h)V(l - |3 (1>3) ) , (47) 


where (46) has been employed in the second step. Note that the presence of 
the voltage probe decreases the transmission below the unity value of a perfect 
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Figure 17 Four-terminal Hall resistance measurements of submicron junctions of different 
shapes. In the junction shown schematically in the upper left, the Hall resistance is negative at 
small B and positive at large B. The reason is indicated in the diagram; at small B, the electrons 
bounce off the wall into the “wrong” probe. (After C. Ford et al.) 

channel. Some of the electrons scatter into the voltage probe, are re-emitted, 
and then return to contact 1 . This shows that voltage probes are in general in- 
vasive; they influence what they measure unless they only couple very weakly 
to the system. 

Figure 17 shows a measurement of the Hall resistance of two nanoscale 
crosses patterned in a high mobility 2D electron gas whose geometries are 
shown in the insets. The junction region is ballistic, meaning that there is no 
scattering from disorder, only from the sample walls. The measured Hall resis- 
tance is not of the form B/n s e, where n s is the sheet carrier concentration, as 
expected for a macroscopic 2D electron gas, but has instead a number of 
notable features. Most surprisingly, the Hall voltage is of the opposite sign at 
low B compared to high B for the sample shown in the upper left inset. This 
can be easily understood from the shape of the classical electron paths 
sketched on the figure. At high B, the Lorentz force preferentially deflects the 
electron into the upper electrode, giving the expected sign of the Hall voltage. 
At low B, however, the electron bounces off the boundary of the conductor 
and arrives at the lower electrode, reversing the sign of the measured Hall 
voltage. For a small multiprobe conductor, the resistance is a measure of the 
electron trajectories through the sample and not simply related to intrinsic 
material properties like the electron density. 

Equations (44-45) can be used to treat arbitrarily complex microscopic (or 
even macroscopic) conductors. It has been widely used to describe measure- 
ments on small disordered metal samples at low temperatures as a function of 
magnetic field B. These samples have many transverse channels and contain 
impurities. The elastic scattering length ( e is less than the sample dimensions, 
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but the phase coherence length is greater. Electrons therefore propagate 
diffusively, but phase-coherently, through the sample. This is called the meso- 
scopic regime. In a semiclassical picture, the transmission amplitude between 
two probes n and m corresponds to the sum of many different classical paths 
through the sample: 

(CGS) t M ex P\ «/*) [ (P ~eA/c) • dl 

i \ 

m \ 

(p — eA)*dlj . (48) 

Note that the phase associated with each path amplitude ctj contains a contri- 
bution from the magnetic vector potential A, as described in Appendix G. 
Since 3 <n ' m) = lt (n m) l z , quantum interference among different conducting path- 
ways through the sample modulates the transmission. 

An interesting example is shown in Fig. 18. On the left, the four-terminal 
resistance of a nanoscale metallic wire is shown. Aperiodic fluctuations are 
seen in the conductance versus magnetic field B. These fluctuations are due to 
modulations of the interference between the many diffusive paths linking the 
contacts. Since there are many paths, the result is an essentially random varia- 
tion. These modulations are referred to as conductance fluctuations. 

When an additional loop is added that is outside the region between the con- 
tacts, as shown on the right of Fig. 18, the conductivity G qualitatively changes. A 
periodic modulation with magnetic field is seen. This is due to the Aharonov- 
Bohm effect. The vector potential modulates the quantum interference between 
those electron paths that encircle the ring and those that do not. For simplicity, 
consider the interference between just two such paths with transmission ampli- 
tudes a } and a 2 (Fig. 18) in the absence of a magnetic field. At finite B, 

(CGS) l fl i + a 2 exp([(ie/hc) <j> A-dl)] p = |a x | 2 + |a 2 | 2 +2|a,||a 2 | cos[2tt<I> /(hcle)] ; 

loop 

(SI) |«! + a 2 exp[(te/ft) A-dl] p = la^ 2 + |a 2 | 2 +2|aJ|a 2 | cos[2tt 4> /(/t/e)] . (49) 

loop 

In the last step we have employed Stokes’s theorem, where <P is the magnetic 
flux going through the loop and hc/e is the magnetic flux quantum (h/e in SI). 
With increasing flux, the transmission though the wire oscillates with the pe- 
riod of one flux quantum. This effect is closely related to the superconducting 
flux quantization discussed in Chapter 10, except that the charge appearing in 
the flux quantum here is e, not 2c, since the carriers here are electrons, not 
Cooper pairs. 
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Figure 18 Upper: SEM micrographs of two vertical Au wires with current and voltage probes at- 
tached. In the device on the right, an extra loop has been added outside the region between the 
probes. The diagram to the right shows two paths, one that encircles the ring and one that does 
not. Lower left: Conductance versus magnetic field for the left sample. Aperiodic conductance 
fluctuations are seen due to quantum interference between the conducting paths through the 
sample. Lower right: Periodic oscillations are observed associated with the Aharonov-Bohm effect 
for paths enclosing the loop nominally outside the region between the contacts, showing the non- 
local nature of diffusive coherent transport in mesoscopic systems. (After R. Webb. ) 


It is remarkable that the addition of a loop outside of the region between 
the voltage contacts changes the measured properties. Resistances in the 
mesoscopic regime are nonlocal. Electrons coherently diffuse throughout the 
entire sample while journeying between contacts, and their phase remembers 
the journey. 
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ELECTRONIC STRUCTURE OF OD SYSTEMS 
Quantized Energy Levels 

A system of electrons fully confined in all three dimensions will have dis- 
crete charge and electronic states, as do atoms and molecules. They are often 
called artificial atoms or quantum dots to reflect the importance of quantiza- 
tion phenomena on their properties. 

As a simple example, consider an electron in a spherical potential well. 
Due to the spherical symmetry, the Hamitonian separates into angular and ra- 
dial parts, giving eigenstates and eigen energies: 

£ n .i,m = e„ ,i ; <P(r,0,<p) = Y lm (6,<t>)R nl (r) , (50) 

where are the spherical harmonics and R n j(r) are the radial wave- 

functions. The energy levels and radial wavefu notions depend on the details of 
the particular confining potential. For an infinite spherical w'ell, where V = 0 
for r < R and is infinite otherwise, 

s„j = ft 2 /S^/(2 m*R 2 ) , 

Ki(r)=ji(P nJ r/R) , r<R . (51) 

The function j[(x) is the 1th spherical Bessel function and the coefficient p nt 
is the nth zero of ji(x). For example, /3 00 = 77 (IS), j3 01 — 4.5 (IP), Po i2 = 5.8 
(ID), /3 10 = 2v (2S), and p lx = 7.7 (2P). The labels in parentheses arc the 
atomic notations for the states, which have the usual degeneracies associated 
with spin and angular momentum orientation. 

Semiconductor Nanocrystals 

A semiconductor nanocrystal such as the one shown in Fig. 2 can, to a 
good approximation, be described by the spherical model given above. Both 
the electron states in the conduction band and the hole states in the valence 
band are quantized. For a CdSe nanoparticle, the conduction band effective 
mass m) = 0.13 m , and the electron energy levels arc e nl = (2.9 eV/R 2 ) 
(Pn/Rofi) 2 , where R, the radius of the nanoparticle, is expressed in nanometers. 
For R ~ 2 nm, the spacing between the lowest two energy levels is 
eo,i-«o,o= 0.76 eV. 

The IS electron state increases in energy with decreasing R, while the IS 
hole state decreases. The bandgap therefore grows and can be tuned over a 
wide range by changing R. This is shown in Fig. 19, where the absorption 
spectra of CdSc nanocrystals of different sizes are presented. For the smallest 
radii, the threshold for absorption shifts by nearly 1 eV from its bulk value. A 
similar shift is seen in the emission spectrum. The optical spectra of nanocrys- 
tals can be tuned continuously across the visible spectrum, making them use- 
ful in applications from fluorescent labeling to light-emitting diodes. 
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Figure 19 Optical absorption spectra for a series of CdSe nanocrystal samples of different aver- 
age radii. The lowest transition energy in the smallest nanocrystal sample is shifted by nearly 1 eV 
from the bulk bandgap. Two dominant transitions are labeled. (Courtesy of A. P Alivisatos.) 


The absorption intensity in nanocrystals becomes concentrated at the spe- 
cific frequencies corresponding to the transitions between discrete states, as 
described by (7). An important result for the integrated absorption can be ob- 
tained from the Kramers-Kronig relations discussed in Chapter 15. From 
Eq. (15.11b), we have: 


o"(w) = ~P 


Ju s 2 - 


(*> 


ds 


or 


(52) 


At very high frequencies, o> — » oo s the electron s response is identical to that of 
a free electron. By (15.20), 

<j"((o) — ne 2 /mo> . (53) 

In addition, as oj — » <» j the frequency s in the denominator of (52) can be ne- 
glected. Combining (52) and (53) then gives 

j (r'(s)ds — ime 2 / 2m . (54) 

Jo 


A bulk semiconductor and a nanocrystal therefore have the same overall 
absorption per unit volume when integrated over all frequencies. It is distrib- 
uted very differently, however. The absorption spectrum of macroscopic 
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semiconductor is continuous, but in a nanocrystal it consists of a series of dis- 
crete transitions with very high absorption intensity at the transition frequen- 
cies. These strong transitions at particular frequencies have motivated re- 
searchers to create lasers that work on the quantized electronic transitions of 
quantum dots. 


Metallic Dots 

For small spherical metallic dots, such as alkali-metal clusters created in an 
atomic beam, the electrons in the conduction band fill up the quantized 
energy levels described by (50), as shown in Fig. 20a. These quantized levels af- 
fect the electrical and optical properties, and can even influence the stability of 
the dot. Small clusters can be analyzed by mass spectroscopy to determine the 
number of atoms in the cluster (Fig. 20b). Since there is one conduction elec- 
tron per atom in an alkali metal, this is also the number of electrons in the con- 
duction band. Large abundances are seen at certain “magic numbers” of atoms 
in the cluster. These result from the enhanced stability for clusters with filled 
electronic shells. For example, the 8-atom cluster peak corresponds to the fill- 
ing of the IS (n = 1, € = 0) and the IP (n = 1, € = 1) shells. These filled-shell 
clusters are analogous to chemically stable filled-shell atoms (the noble gases). 

For larger or irregularly shaped metallic dots, the shell structure is de- 
stroyed. The level spacing becomes small in comparison to the shifts in the 
levels due to shape imperfections, faceting of the crystals, or disorder. While 
the details of the level spectrum are difficult to predict, the average level spac- 
ing at the Fermi energy can be estimated using (6.21) as 

As = 1 /D(s f ) = 2e F /3N . (55) 


For a spherical Au nanoparticle with R = 2 nm, the average level spacing is 
As ~ 2 meV. This is much smaller than the spacing between the lowest states 
in the CdSe nanocrystal conduction band (0.76 eV) calculated earlier. Energy- 
level quantization effects are much more important in semiconductor dots 
than they are in metallic ones. This is because the energy-level spacing is 
larger for low-lying states in a 3D potential well and also because the electron 
effective mass for semiconductors is typically smaller than for metals. 

The optical properties of a small metallic dot are typically dominated 
by its surface plasmon resonance. The polarizability of a sphere is, from 
(16.11), 


(CGS) 


XE 0 

1 + 4vxl3 



XEqEq 

l+tf/3* 


(56) 


where E 0 is the external electric field and x is the electronic susceptibility. 
This relation was presented in Chap. 16 for the static case, but it applies at 
high frequencies as long as the dot is small enough for retardation effects to be 
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Figure 20 (a) Energy level diagram for the states in a small spherical alkali metallic cluster. The 
numbers at right of the diagram show the number of electrons required to fill successive elec- 
tronic shells, (b) Abundance spectrum of Na clusters, showing high intensities for clusters with 
completely filled electronic shells. (After W. A. de Heer et al.) 


absent. Modeling the carriers in the dot as a lossless free electron gas, the sus- 
ceptibility is, from (14.6), 

(CGS) y(w) = —ne 2 /mo) 2 ; (SI) y(&>) = —ne 2 /im 0 wr . (57) 

Combining (56) and (57) gives 


(CGS) 

PM = 

E 0 

E 0 

l—4Tme 2 /3m(0 2 

l—(i) 2 /3(n 2 ’ 

(SI) 

P(o>) = 

s oEq 

© 

Co 

1 

l~ne 2 /3ms Q a) 2 

1— tWp/3 a) 2 


( 58 ) 
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where <o p is the plasma frequency of the bulk metal. The polarization diverges 
at a frequency: 

= u> p /V3 . (59) 

This is the surface plasmon resonance frequency for a sphere. It shifts the bulk 
plasma resonance for metals like Au and Ag from the UV into the visible por- 
tion of the spectrum. The result (59) is independent of particle size. In reality, 
however, the optical properties do depend somewhat on size due to retar- 
dation effects at larger radii and losses and intraband transitions at smaller 
radii. 

Liquids or glasses containing metallic nanoparticles are often brilliantly 
colored due to absorption by the surface plasmon resonance. They have been 
used for hundreds of years in stained glasses. Other optical applications of 
metallic nanoparticles make use of the large electric field just outside the 
nanoparticle near resonance. In techniques like surface enhanced Raman scat- 
tering (SERS) or second harmonic generation (SHG), weak optical processes 
in nanostructures near the surface of the nanoparticlc become measurable due 
to the locally high electric fields. 

Discrete Charge States 

If a quantum dot is relatively isolated electrically from its environment, it 
has a set of well-defined charge states, like an atom or molecule. Each succes- 
sive charge state corresponds to the addition of one more electron to the dot. 
Because of the coulomb repulsion between electrons, the energy difference 
between successive charge states can be very large. Within the Thomas-Fermi 
approximation (28), the electrochemical potential for adding the (N + l)th 
electron to a dot containing N electrons is given by: 

juw+i - •sy+i ~e<p = e N+1 + NU ~ aeV g , (60) 

where L7 is the coulomb interaction energy between any two electrons on the dot, 
often called the charging energy. The dimensionless number a is the rate at 
which a voltage V g applied to a nearby metal, typically referred to as the gate 
(see Fig. 21), shifts the electrostatic potential <p of the dot. 

In general, V will vary for different electronic states in the dot, but we as- 
sume here it is a constant, as in a classical metal. In this case, we can describe 
the electrostatics and interactions in terms of capacitances: 

U = e 2 /C and a = C g /C , (61) 

where C is the total electrostatic capacitance of the dot and C, is the capaci- 
tance between the dot and the gate. The quantity e/C is the electrostatic po- 
tential shift of the dot when one electron is added. 

If the dot is in weak electrical contact with a metallic reservoir, electrons 
will tunnel onto the dot until the electrochemical potential for adding another 
electron exceeds the electrochemical potential /x of the reservoir (Fig. 21). 
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Figure 21 (a) Schematic illustration of a quantum dot in tunnel contact with two metallic reser- 
voirs and capacitatively coupled to a gate. Main: Energy level diagrams illustrating the coulomb 
blockade. In (b) the gate voltage is such that the dot is stable with N electrons, so no current flows. 
In (c) the blockade is lifted when the electrochemical potential is lowered into the window be- 
tween the potentials in the leads, allowing successive charging and discharging of the dot and a net 
current flow. 


This sets the equilibrium occupancy N of the dot. The charge state can be 
changed using the gate voltage V g . The additional gate voltage AV g required to 
add one more electron from a reservoir of fixed /jl is, from (60), 

AV g = (1 /ae)(s N+1 - s N + e 2 /C) . (62) 

Adding an extra electron to the dot requires enough energy to fill up the next 
single-particle state and also enough energy to overcome the charging energy. 

The charging energy U depends on both the size of the dot and the local 
electrostatic environment. Nearby metals or dielectrics will screen the coulomb 
interaction and reduce the charging energy. In general, U must be calculated for 
the specific geometry. As a simple model, consider a spherical dot of radius R 
surrounded by a spherical metal shell of radius R + d. This shell screens the 
coulomb interaction between electrons on the dot. An elementary application of 
Gauss’s law (Problem 5) gives the capacitance and therefore the charging energy: 


(CGS) 


U = 


e 2 d 
sRR + d ; 



= c 2 d 
Aire^R R + d 


(63) 


For R = 2 nm, d = 1 nm, and e = 1, the charging energy is e 2 /C = 0.24 eV. 
This far exceeds k B T *= 0.026 eV at room temperature, indicating that thermal 
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fluctuations in the charge of the dot will be strongly suppressed. It is compara- 
ble to the energy level spacing (0.76 eV) between the lowest two states in a 2-nm- 
radius CdSe dot. In contrast, it is much larger than the level spacing (2 meV) for 
a 2-nm-radius metallic dot. The addition energy of a metallic dot is therefore 
dominated by the charging energy, but in a semiconductor dot the charging 
energy and the level spacing are of comparable importance. 

Charging effects are destroyed if the tunneling rate between the dot and 
the electrodes is too rapid. The charge resides on the dot for a time scale of 
order St = RC, where R is the resistance for tunneling to the electrodes. By 
the uncertainty principle, the energy level will be broadened by 

8e « h/St = h/RC = (e 2 /C)(h/e 2 )/R . (64) 

The uncertainty in the energy of the electron becomes comparable to the 
charging energy when R ~ hie 2 . For resistances below this value, quantum fluc- 
tuations due to the uncertainty principle smear out the coulomb charging 
effects. The conditions for well-defined charge states ol a quantum dot are then 

R>h/e 2 and e 2 /C>k B T . (65) 

ELECTRICAL TRANSPORT IN OD 
Coulomb Oscillations 

At temperatures T < (U + ks)/k B , the charging energy U and the level 
spacing control the flow of electrons through a quantum dot, as shown in 
Fig. 21. Transport through the dot is suppressed where the Fermi levels of the 
leads lie between the electrochemical potential for the N and N + 1 charge 
states (Fig. 21b). This is called the Coulomb blockade. Current can only flow 
when fi e (N + 1) is lowered to lie between the Fermi levels of the left and right 
leads. Then an electron can hop on the dot from the left electrode and off the 
dot to the right electrode, resulting in current flow (Fig. 21c). This process re- 
peats with increasing V g for each new charge state. This leads to so-called 
Coulomb oscillations in the conductance as a function of V g , shown in Fig. 22. 
If U §> k R T, these peaks can be very sharp. The spacing between the Coulomb 
peaks is determined by (62). 

Coulomb oscillations are first and foremost a result of charge quantiza- 
tion. They will occur if U > k R T even if the single particle level spacing is very 
small, As < k B T. This is often the case in metallic quantum dots. A device 
showing Coulomb oscillations is called a single electron transistor (SET), 
since it turns on and off periodically as the occupancy of the dot is changed 
by e. This effect is quite remarkable, and can be used as an ultrasensitive 
electrometer. It detects electric fields much as a SQUID (Chap. 10) detects 
magnetic fields. One is based on the quantization of charge, the other on the 
quantization of flux. 
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Figure 22 Conductance oscillations versus gate voltage V g measured in a quantum dot formed in 
a gated GaAs/AlGaAs beterostructure at i = 0.1 X. The data are plotted on a log scale. As the gate 
voltage increases, the barriers become more transparent and the peaks get broader. The lineshape 
of the peak in (b) is determined by thermal broadening alone, while the one in (c) also reflects the 
intrinsic Breit-Wigner lineshape. (Adapted from Foxman et al.) 


SETs can also he used to make single electron turnstiles and pumps. Oscil- 
lating voltages at a frequency/ applied to the gates of a properly designed 
quantum dot system can shuttle a single electron through dot per cycle of the 
oscillation. This results in quantized current flowing through the dot: 

1 = ef . (66) 

Such devices are under investigation as current standards in metrology. 

For the quantum dot in Fig. 22, the level spacing A/? > k B T. The A 7 th 
Coulomb oscillation then corresponds to resonant tunneling through a single 
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Figure 23 (a) Schematic of a 2D circular quantum dot formed in a GaAs/AlGaAs heterostruc- 
ture. (b) The differential conductance dl/dV as a function of both gate voltage and source drain 
bias, plotted as a gray scale. The white diamond regions correspond to different charge states of 
the dot. A larger charging energy is observed for N = 2 and 6 electrons on the dot, corresponding 
to filled electronic shells. The additional lines on the diagram correspond to excited energy levels 
of the dot. (Courtesy of L. Kouwenhoven. ) 


quantized energy level e N . The coulomb oscillations in this case are analogous 
to the theoretical resonant tunneling peaks described by (29) and shown in 
Fig. 15. A crucial difference from (29) is that the positions of the coulomb 
peaks are determined by both the level spacing and the coulomb charging 
energy (62). The lower right panel of Fig. 22 shows a fit of one coulomb peak 
to the Breit-Wigner form for resonant tunneling (33). 

The I-V characteristics of a quantum dot are in general complex, reflect- 
ing the interplay of the charging energy, the excited state level spacing, and 
the source-drain bias voltage. In Fig. 23, measurements of the first few elec- 
trons added to a small, 2D circular dot are shown. The differential conduc- 
tance dl/dV is represented using a gray scale as both the gate voltage and the 




554 


source drain bias are varied. Each of the lines seen corresponds to tunneling 
through an individual quantum state of the dot. The white diamonds along the 
V„ axis, indicating dl/dV — 0, correspond to the Coulomb blockade. Each 
successive diamond corresponds to another electron on the dot. The point at 
which different diamonds touch along the V, axis are the Coulomb oscillations 
where the charge state of the dot changes. The height of the diamond corre- 
sponds to eV m;tx — e 2 IC + A*-, the maximum voltage that can be applied with- 
out current flowing in a given charge state. The diamonds corresponding to 
N = 2 and N — 6 are noticeably larger than neighboring diamonds, indicating 
a larger addition energy for adding the third and seventh electron to the dot. 

This dot can be effectively modeled by a 2D harmonic oscillator confining 
potential U(x, y) = ' 2 mor(x 2 + y 2 ), with energy levels: 

= (i +j + 1 )hai , (67) 

where i and / are nonnegative integers. This level spectrum can be used in 
conjunction with (62) to find the addition energies that determine the sizes of 
the diamonds in the figure. The first electron fills the spin-degenerate ground 
state energy level s 00 . The second electron fills the same quantum state, but 
with opposite spin, at a gate voltage AV g2 = U/ae after the first electron is 
added. The third electron fills one of the degenerate states s, n or £ Ui after a 
AV g3 = (U + ha>)/ae. The next three electrons fill the rest of these states, each 
spaced in gate voltage by U/ae. The seventh electron fills one of the degener- 
ate states s n , s 20 , or e 02 , after a gate voltage AV„ 7 = (U + tua)lae. This simple 
model correctly predicts the larger addition energies for the 3rd and 7th 
electrons seen in the experiment. The addition energy is larger by the level 
spacing when the extra electron is added to a new energy' level above a filled 
electronic shell (N = 2 and N — 6). 


Spin , Mott Insulators, and the Kondo Effect 

Consider a quantum dot that is occupied by an odd number of electrons in 
the blockaded region, as shown in Fig. 24. The highest single particle energy 
level of the dot is doubly degenerate and an electron can either reside in a 
spin-up or spin-down state. The addition of a second electron of opposite spin 
is allowed by the Pauli exclusion principle but is energetically prohibited by 
the coulomb interaction between the electrons. This is analogous to the Mott 
Insulator where a half-filled band is insulating because coulomb interactions 
prohibit double -occupancy of the lattice sites by electrons. 

The dot therefore has a spin-l^ magnetic moment with two degenerate 
configurations, spin up and spin down, in the absence of coupling to the leads. 
If coupling to the leads is included, however, this degeneracy is lifted at low 
temperatures. The ground state is a quantum superposition of the two 
spin configurations, with transitions between them accomplished by a virtual 
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(a) (b) 




(c) (d) 

Figure 24 The Kondo effect in a quantum dot. For an unpaired spin on the dot, a virtual process 
(b) can occur that converts the spin up (a) to the spin down (c) state and transfers an electron from 
one side of the dot to the other. The ground state of the system is a coherent superposition on the 
initial and final states shown, creating a spin singlet between the spin on the dot and the spins in 
the leads. This is called the Kondo effect, and produces a narrow peak of width ~k B T K in the den- 
sity of states at s F in addition to the original broadened level of width T, as shown in (d). 


intermediate state involving an exchange of electrons with the leads, as is illus- 
trated in Fig. 24. This is known as the Kondo effect. The local moment pairs 
with electrons in the metallic electrodes to create a spin singlet. This occurs 
below a temperature known as the Kondo temperature T K : 

Tk = l(rur exp[7T* 0 (* 0 + UWU)] (68) 

where T is the level width defined in (33) and e 0 is indicated in Fig. 24 
( e 0 < 0). A peak in the density of states of the dot of width k B T K appears at the 
Fermi energy due to the admixture of states in the electrodes at an energy e F . 
The Kondo temperature is very small unless the coupling T to the leads is 
large, since the process involves a virtual intermediate state. 
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Because the Kondo effect involves exchange of electrons with the leads, it 
causes transmission through the dot even in the blockaded region, as illus- 
trated in Fig. 24a-c. For symmetric harriers and T <3 T K , the transmission 
coefficient through the Kondo resonance can be unity, just as for resonant tun- 
neling. This effect has been seen in transport through quantum dots and in 
STM measurements of magnetic impurities on metal surfaces. The Kondo ef- 
fect was first observed in metals containing magnetic impurities. The forma- 
tion of a spin singlet between the magnetic impurities and the conduction 
electrons enhances the scattering of the electrons. This will be discussed fur- 
ther in Chap. 22. 

Cooper Pairing in Superconducting Dots 

In a small metallic dot made of a superconductor, there is an interest- 
ing competition between single electron charging and the Cooper pairing 
of electrons. With an odd number of electrons residing on the dot, there is 
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Figure 25 Measurement of coulomb oscillations in a superconducting metallic dot with decreas- 
ing temperature. A crossover from e-periodic oscillations to 2e-periodic oscillations is seen as the 
temperature is lowered due to the Cooper pairing of electrons on the dot. (After M. Tinkhain, 
J. M. Hergenrother, and J. G. Lu.) 
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necessarily an unpaired electron. If the Cooper pair binding energy 2A is 
larger than the charging energy U, it is energetically favorable to add an elec- 
tron to the dot, paying the energy U in order to gain the pairing energy 2A. 
The odd-charge states are thus energetically unfavorable. Electrons will be 
added to the dot in Cooper pairs, and the Coulomb oscillations will be 2e- 
periodic. This is shown in Fig. 25. This is a remarkable manifestation of Cooper 
pairing. 


VIBRATIONAL AND THERMAL PROPERTIES 

To treat the vibrational properties of nanostructures, we will begin from a 
continuum description of the elastic properties. This is analogous to employ- 
ing the band structure as the starting point to describe the electronic proper- 
ties. It is a good approximation for all but the smallest of nanostructures. 

In general, the components of stress and strain in a solid are related by a 
matrix. A stress along one axis will produce a strain along that axis, but it will 
also produce strains along other axes. For example, a cube stretched along one 
axis will typically contract somewhat along the orthogonal axes. To simplify the 
discussion below, we will ignore the off-diagonal elements and treat the stress- 
strain matrix as diagonal and isotropic. In other words, strains will only occur 
along the direction of the stress and the magnitude will be independent of the 
axis direction. For a more complete treatment, we refer the reader to advanced 
texts on mechanics. 

Quantized Vibrational Modes 

Just as the electronic degrees of freedom are quantized, the vibrational 
frequencies become discrete in a ID or OD solid. The continuous low- 
frequency modes associated with the acoustic modes, a> — v\K, become instead 
a series of discrete frequencies a>j. The exact frequencies and wavevectors 
depend in detail on the shape and boundary conditions of the solid. 

An illustrative example is die vibrations around the circumference of a 
thin cylinder of radius R and thickness h 71. as shown in Fig. 26. In Fig. 26a, 





Figure 26 Fundamental vibrational modes of a thin-walled cylinder. Image (a) is a longitudinal 
compressional mode, (b) is the radial breathing mode (RBM), and (c) is a transverse mode. 
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a quantized longitudinal acoustic mode is schematically illustrated. The al- 
lowed frequencies can be found by applying periodic boundary conditions 
around the circumference of the cylinder: 

Kj = j/R ; <o Lj ~ ( v L /R)j ; j = 1, 2 ... . (69) 

Another mode, called the radial breathing mode (RBM), is shown in 
Fig. 26b. The radius of the cylinder uniformly expands and contracts, producing 
circumferential tension and compression. From elasticity theory, the elastic 
energy associated with a strain e for an isotropic medium is given by 

U tot = l[re 2 dV , (70) 

Jv 

where Y is the elastic (Young’s) modulus. The strain in the cylinder for a radius 
change dr is e = dr/R, yielding 


, (71) 

where V is the volume of the cylinder. Equation (71) has the form of a Hooke’s 
law spring energy where the spring constant is given by YV/R 2 . The vibrational 
frequency is then 

wrbm - (YV/MRY 2 = (Y/p) yz /R = v l /R , (72) 

where in the last step we have defined a longitudinal sound velocity v L — VY/p. 

The final class of quantized modes around the circumference, correspond- 
ing to transverse acoustic modes, are shown in Fig. 26c. Their wavevectors and 
frequencies are given by 

v L hf 

t-cs i™> 

Note the frequency scales like Kj. The origin of this behavior will be discussed 

further below. 

Quantized vibrational modes can be measured in a variety' of ways. One 
technique widely used to probe the vibrational structure of individual 
nanoscale objects is Raman spectroscopy (see Chap. 15). Raman spectroscopy 
of single nanotubes is shown in Fig. 27. For a nanotube, v L = 21 km/s, and the 
energy of the radial breathing mode is, from (72), 

Aw HhM = 14 meV/R[nm] . 

The measured values are in good agreement with this expression. As a 
result, measurements of the RBM can be used as a diagnostic to infer the 
radius of a nanotube. 
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Figure 27 Raman spectra of individual carbon 
nanotubes. The radial breathing mode frequencies 
are labeled in the main panel, along with the struc- 
tural assignments ( n , m ). Note: 160 cm -1 = 20 meV. 
(After A. Jorio et al.) 


Transverse Vibrations 

We now address the phonons propagating in the direction of the axis of a 
long, thin object, such as the cylinder discussed in the previous section or a 
thin, solid beam (Fig. 28). The longitudinal phonons are similar to the 3D 
case, with a dispersion co — v L K, where K is the continuous wavevector. How- 
ever, there is a fundamental modification of the transverse phonons at wave- 
lengths longer than the thickness h of the beam. Instead of shearing, as in a 
bulk transverse phonon, the solid bends, as shown in Fig. 28a. This is the 
classical problem of transverse flexural waves on a beam. The energy of bend- 
ing comes from the longitudinal compression/stretching of the solid along the 
inner/outer arcs of the bend. The linear wavevector dependence oj t = v T K 
characteristic of a bulk solid is changed to dispersion quadratic in K, as we 
show below. 

Consider a transverse standing wave on a solid rectangular beam of 
thickness h, width w, and length L whose displacement is given by y(z,t) = y 0 
cos (Kz — u>t). The strain at a given point inside the beam is given by the local 
curvature and the distance t from the center line of the beam (Fig. 28a): 

e = ~(d 2 y/dz 2 )t = K 2 yt . (74) 

The total energy associated with this strain is again given by (70): 

U tot = (wY/2)J jT {Khytfdt dz = YVK 4 h 2 (y 2 )/24 , (75) 

where (y 2 ) is averaged over one period of the oscillation. This again gives an 
effective spring constant and, from steps analogous to (70-72), an oscillation 
frequency: 


w T = vJiKWw, . 


(76) 



560 




500 

400 

300 

'n' 

X 

s 

'-•> 200 

100 

0 


L (yum) 

(b) 


Figure 28 (a) Stresses in a bent beam, showing the inner portion under compression while the 
outer portion is under tension, (b) SEM micrograph of a series of suspended Si beams of varying 
lengths L and the measured resonance frequency as a function of L. The line is a fit to the func- 
tional form/ = B/L 2 , where B is a constant. (After D. W. Carr et al.) 


Note that the frequency depends on the longitudinal sound velocity and not 
the transverse sound velocity, since the mode is now essentially compressional 
in nature. It is no longer linear in K since the effective restoring force grows 
stronger with increasing curvature, i.e., increasing K. In contrast, the torsional 
mode, corresponding to a twist of the beam along its length, retains its shear 
character and, co^t °c K. 

Transverse vibrational modes described by (76) are frequently observed in 
microscale and nanoscale beams. A set of nanoscale beams constructed in 
Si using electron beam lithography and etching is shown in Fig. 28b. The 
frequencies associated with fundamental resonance K { = 2tt/L of these beams 
scale as 1/L 2 (Fig. 28c), as expected from (76). 

Note that the modification of the dispersion relation (76) for long wave- 
length transverse modes is not restricted to nanoscale systems. The only re- 
quirement is that the system be in the geometry of a thin beam or slab with 
transverse dimension h smaller than the wavelength, i.e., Kh < 1. For exam- 
ple, AFM cantilevers operated in the noncontact mode, as discussed above, 
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are well described by this relation. The dispersion relation (76) is also related 
to the a) ~ K 2 dependence seen in (73) far the class of modes shown in Fig. 
26c. Both describe the transverse flexural vibrations, one of a beam and the 
other of a thin shell. 

A revolution is underway in the fabrication of small, complex mechanical 
structures using techniques adapted from microelectronic processing. The 
beams shown in Fig. 28b are a simple example. These structures can be inte- 
grated with electronic devices, creating microelectromechanical systems 
(MEMs) and nanoelectromechanical systems (NEMs). They are being ex- 
plored for a variety of applications, including sensing, data storage, and signal 
processing. 

Heat Capacity and Thermal Transport 

The above relations indicate that the quantized vibrational mode energies 
are typically less than k B T at room temperature except in the very smallest 
structures. Modes along the confined directions will thus be thermally excited 
at room temperature. As a result, the lattice thermal properties of nanostruc- 
tures will be similar to their bulk counterparts. In particular, the lattice heat 
capacity and thermal conductivity will be proportional to T'\ as for a 3D solid 
(Chap. 5). 

At low temperatures, however, vibrational excitations of frequency co in 
the confined directions will freeze out when T < hu)/k B . In the case of a long, 
thin structure, the system will behave as ID thermal system at sufficiently low 
temperatures, with a set of ID phonon subbands analogous to the ID elec- 
tronic subbands shown in Fig. 9. A calculation analogous to the one performed 
in Chap. 5 for a 3D solid yields the heat capacity per ID acoustic phonon sub- 
band with a dispersion io — vk (Prob. 6): 

CV 1D) = ZrfLklT/Shv . (77) 

The thermal conductance G th of the wire is defined as the ratio of the net en- 
ergy flow through the wire divided by the temperature difference AT between 
its ends. The thermal conductance per ID phonon subband is 

G$, d) = (77^17/3/1)3 . (78) 

This result is derived in Prob. 6 using an approach analogous to the one em- 
ployed to derive the Landauer formula for conductance of a ID channel. Note 
3 is now the transmission probability for phonons through the structure. For a 
transverse flexural mode, where a> oc K 2 . the result (77) is modified, but (78) is 
the same. 

Both (77) and (78) are linear in temperature, in contrast to the T 3 result 
for 3D. The difference reflects the number of modes with energies ha> < k B T, 
or equivalently, with wavevectors K < k s T/hv. The number of modes scales 
like K d , where D is the dimensionality, producing T 3 in 3D and T in ID. 



Note that in the case of perfectly transmitted phonons through the chan- 
nel, the thermal conductance (78) is determined only by fundamental con- 
stants and the absolute temperature. This result is analogous to the quantized 
electronic conductance (21) of a ID channel, which was independent of the 
electron velocity in the channel. Both the ballistic thermal conductance (78) 
and the ID form of the heat capacity' (77) have been observed in experiments 
on narrow wires at very low temperatures. 

SUMMARY 

• Real space probes can give atomic-scale images of nanostructures. 

• The density of states of a I D subband, D(E) — 4L/hv, diverges at the 
subband thresholds. These are called van ffove singularities. 

• The electrical conductance of a ID system is given by the Landauer formula, 
G = (2e 2 //i)3, where 3 is the transmission coefficient through the sample. 

• The conductance of a quasi-ID system can be strongly influenced by quan- 
tum interference among the electron paths traversing the sample, leading to 
resonant tunneling, localization, and the Aharonov-Bohm effect. 

• The optical properties of a quantum dot can be tuned by changing its size 
and hence its quantized energy levels. 

• Adding an extra charge e to a quantum dot requires an additional electro- 
chemical potential given by U + As, where U is the charging energy and As 
is the level spacing. 

• The vibrational modes of a nanometer-scale object are quantized. 


Problems 

1. Carbon nanotube band structure. Figure 29 shows the graphene lattice with the 
primitive lattice translation vectors of length a = 0.246 nm, along with the first 
Brillouin zone, (a) Find the set of reciprocal lattice vectors G associated with the 
lattice, (b) Find the length of the vectors K and K' shown in Fig. 29 in terms of a. 

For energies near the Fermi energy and wavevectors near the K point, the 2D 
band structure can be approximated as 

e = ±7b f |Ak| Ak = k — K , 

where v F = 8 X 10 s m/s. A similar approximation holds near the K' point. Consider 
a tube rolled up along the x-axis with a circumference na. By applying periodic 
boundary conditions along the rolled up direction, the dispersions of the ID sub- 
bands near the K. point can be found, (c) Show that, if n is divisible by 3, there exists 
a “massless” subband whose energy is linear in A k r Sketch this subband. These, 
nanotubes are ID metals, (d) If n is not divisible by 3, the subband structure is that 
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Figure 29 (a) The graphene lattice and 
(b) the first Brillouin zone of the 
graphene lattice showing the conical dis- 
persion of the energies near the K and K' 
points. 


as shown in Fig. 10. For the case of n = 10, find the magnitude of the semiconduct- 
ing bandgap e u in eV and show that e^/e n = 2. (e) Again for the n = 10 case, show 
that the dispersion relation of the lowest electron subband is of the form of a rela- 
tivistic particle, e 1 — (m*c 2 ) 2 + (pc) 2 , where v F plays the role of the speed of light, 
and find the ratio of effective mass m* to the free electron mass m. 

2. Filling subbands. For electrons in a square GaAs wire of width 20 nm, find the 
linear electron density at which the n x = 2, n y = 2 subband is first populated in 
equilibrium at T = 0. Assume an infinite confining potential at the wire boundary. 

3. Breit-Wigner form of a transmission resonance. The purpose of this problem is 
to derive (33) from (29). (a) By expanding the cosine for small phase differences away 
from resonance, 8<p = ip" — 2 mi, find a simplified form of (29) involving only I t x \ 2 , lf 2 l 2 , 
and 8tp. (b) Show that, for states in a ID box, the following relation holds between 
small changes in the phases and small changes in the energy: 8e/ Ae = Sip/ 2 tt, where 
Ac is the level spacing, (c) Combine (a) and (b) to obtain (33). 

4. Barriers in series and Ohm's Law. (a) Derive (36) from (35). (b) Show that the 
ID Drude conductivity cr w = n lD e 2 r/m can be written as cr w = (2e 2 /h)( B . (Note: 
The momentum relaxation rate and the backscattering rates are related as 1/t = 2/t b 
because the former corresponds to the relaxation from p to 0 while the latter corre- 
sponds to relaxation from p to — p.) 


5. Energies of a spherical quantum dot. (a) Derive the formula (63) for the charg- 
ing energy, (b) Show that, for d < R, the result is the same as that obtained using 
the parallel plate capacitor result, C = ee„A/d. (c) For the case of an isolated dot, 
rf — > oo , find the ratio of the charging energy to lowest quantized energy level. Express 
your answer in terms of the radius R of the dot and the effective Bohr radius a B . 

6. Thermal properties in ID. (a) Derive the formula (77) for the low temperature heat 
capacity of a single ID phonon mode within the Debye approximation, (b) Derive the 
relation for the thermal conductance (78) of a ID phonon mode between two reser- 
voirs by calculating the energy flow out of one reservoir at a temperature T, and 
subtracting the energy flow from the other reservoir at a temperature T 2 . Use an ap- 
proach analogous to that used to obtain (20) and (24) for the electrical conductance. 
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CHAPTER 19: NONCRYSTALLINE SOLIDS 


The terms amorphous solid, noncrystalline solid, disordered solid, glass, 
or liquid have no precise structural meaning beyond the description that the 
structure is “not crystalline on any significant scale.” The principal structural 
order present is imposed by the approximately constant separation of nearest- 
neighbor atoms or molecules. We exclude from the present discussion disor- 
dered crystalline alloys (Chapter 22) where different atoms randomly occupy 
the sites of a regular crystal lattice. 


DIFFRACTION PATTERN 

The x-ray or neutron diffraction pattern of an amorphous material such as 
a liquid or a glass consists of one or more broad diffuse rings, when viewed on 
the plane normal to the incident x-ray beam. The pattern is different from the 
diffraction pattern of powdered crystalline material which shows a large num- 
ber of fairly sharp rings as in Fig. 2.17 of Chap. 2. The result tells us that a liq- 
uid does not have a unit of structure that repeats itself identically at periodic 
intervals in three dimensions. 

In a simple monatomic liquid the positions of the atoms show only a short 
range structure referred to an origin on any one atom. We never find the cen- 
ter of another atom closer than a distance equal to the atomic diameter, but at 
roughly this distance we expect to find about the number of nearest-neighbor 
atoms that we find in a crystalline form of the material. 

Although the x-ray pattern of a typical amorphous material is distinctly 
different from that of a typical crystalline material, there is no sharp division 
between them. For crystalline powder samples of smaller and smaller particle 
size, the powder pattern lines broaden continuously, and for small enough 
crystalline particles the pattern becomes similar to the amorphous pattern of a 
liquid or a glass. 

From a typical liquid or glass diffraction pattern, containing three or four 
diffuse rings, the only quantity which can be determined directly is the radial 
distribution function. This is obtained from a Fourier analysis of the experi- 
mental x-ray scattering curve, and gives directly the average number of atoms 
to be found at any distance from a given atom. The method of Fourier analysis 
is equally applicable to a liquid, a glass, or a powdered crystalline material. 

It is convenient to begin the analysis of the diffraction pattern with Eq. 
(2.43). Instead of writing it for the structure factor of the basis, we write the 
sum for all the atoms in the specimen. Further, instead of specializing the scat- 
tering to the reciprocal lattice vectors G characteristic of a crystal, we consider 
arbitrary scattering vectors Ak = k' — k, as in Fig. 2.6. We do this because 
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scattering from amorphous materials is not limited to the reciprocal lattice 
vectors, which in any event cannot here be defined. 

Therefore the scattered amplitude from an amorphous material is de- 
scribed by 


S(Ak) = 2/™ exp(— zAk • rj , (1) 

m 

with f m the atomic form factor of the atom, as in Eq. (2.50). The sum runs over 
all atoms in the specimen. 

The scattered intensity at scattering vector Ak is given by 

f(Ak) = S*S -=■ y. 2/m/„ exp[iAk • (r,„ - rj] , (2) 

m n 

in units referred to the scattering from a single electron. If a denotes the angle 
between Ak and r m — r„, then 


I(K, a) = 2 'Lfnfn exp (iKr mn cos a) , (3) 

m n 


where K is the magnitude of Ak and r mn is the magnitude of r„, — r n . 

In an amorphous specimen the vector r m — r n may take on all orientations, 
so we average the phase factor over a sphere: 


1 f 1 

(expiiKr cos a)) = — 2n J d ( cos a) exp (iKr mn cos a) 


sin Kr m 
Kr m „ 


(4) 


Thus we have the Debye result for the scattered intensity at Ak: 


1(K) = 2 E (fmfn sin Kr mn )/Kr mn . (5) 

m 


Monatomic Amorphous Materials 

For atoms of only one type, we let/,,, = /„ = / and separate out from the 
summation (5) the terms with n — m. For a specimen of N atoms, 


l(K) = Nf 1 


+ 2' (sin K r nJ/Kr mn 


( 6 ) 


The sum runs over all atoms m except the origin atom m = n. 

If p(r) is the concentration of atoms at distance r from a reference atom, 
we can write (6) as 

I(K) — A/f l + J dr 4'7rr 2 p(r)(sin Kr)/Kr , 


(7) 
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where R is the (very large) radius of the specimen. Let p 0 denote the average 
concentration; then (7) may be written as 

I(K) = Nf 2 ! I 4- J dr 47rr 2 [p(r) — p 0 ](sin Kr)/Kr + ip</K) J dr 47 rr sin Kr 


( 8 ) 


The second integral in (8) gives the scattering from a uniform concentration 
and may be neglected except in the forward region of very small angles; it re- 
duces to a delta function at K = 0 as R — > . 


Radial Distribution Function 

It is convenient to introduce the liquid structure factor defined by 

S(K) = I/Nf . (9) 

Note that this is not at all the same as S(Ak) in (1). From (8) we have, after 
dropping the delta function contribution, 

S(K) — 1 + f dr 47rr 2 |p(r) — p 0 ](sin Kr)/Kr . (10) 

Jo 

We define the radial distribution function g(r) such that 

p(r) = g(r)p 0 . (11) 


Then (10) becomes 

S(K) = 1 + 47rp D j dr[g(r) — l]r 2 (sin Kr)/Kr 
Jo 

= 1 + Po J dr [g(r) - 1] exp(iK • r) , 


( 12 ) 


because (sin Kr)/Kr is the spherically symmetric or s term in the expansion of 
exp(tK • r). 

By the Fourier integral theorem in three dimensions, 


g(r) ~ 1 


\ dK [S(JQ - 1] exp(— iK 
8irpo J 


r) 


27T £ p„r 


J dK [, S(K ) - 1 ]K sin Kr 


(13) 


This result allows us to calculate the radial distribution function g(r) (also called 
the two-atom correlation function) from the measured structure factor S(K). 

One of the simplest liquids well suited to x-ray diffraction study is liquid 
sodium. The plot of the radial distribution 47rr 2 p(r) vs. r is given in Fig. 1, to- 
gether with the distribution of neighbors in crystalline sodium. 
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Figure 1 (a) Radial distribution curve 4irr 2 p(r) for liquid sodium, (b) Average density curve 
47rr 2 p 0 . (c) Distribution of neighbors in crystalline sodium. (After Tarasov and Warren.) 


Structure of Vitreous Silica , Si0 2 

Vitreous silica (fused quartz) is a simple glass. The x-ray scattering curve is 
given in Fig. 2. The radial distribution curve 4'irr 2 p(r) vs. r is given in Fig. 3. 
Because there are two kinds of atoms, p(r) is actually the superposition of two 
electron concentration curves, one about a silicon atom as origin and one 
about an oxygen atom as origin. 

The first peak is at 1.62 A, close to the average Si-O distance found in 
crystalline silicates. The x-ray workers conclude from the intensity of the first 
peak that each silicon atom is tetrahedrally surrounded by four oxygen atoms. 
The relative proportions of Si and O tell us that each O atom is bonded to two 
Si atoms. From the geometry of a tetrahedron, the 0-0 distance should be 
2.65 A, compatible with the distance suggested by the shoulder in Fig. 3. 

The x-ray results are consistent with the standard model of an oxide glass, 
due to Zachariasen. Figure 4 illustrates in two dimensions the irregular struc- 
ture of a glass and the regularly repeating structure of a crystal of identical 
chemical composition. The x-ray results are completely explained by picturing 
glassy silica as a random network in which each silicon is tetrahedrally sur- 
rounded by four oxygens, each oxygen bonded to two silicons, the two bonds 
to an oxygen being roughly diametrically opposite. The orientation of one 
tetrahedral group with respect to a neighboring group about the connecting 
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Figure 2 Scattered x-ray intensity vs. scattering angle 0, for vitreous Si0 2 . (After B. E. Warren.) 


Si-O-Si bond can be practically random. There is a definite structural scheme 
involved: each atom has a definite number of nearest neighbors at a definite 
distance, but no unit of structure repeats itself identically at regular intervals 
in three dimensions, and hence the material is not crystalline. 

It is not possible to explain the x-ray results by assuming that vitreous 
silica consists of very small crystals of some crystalline form of quartz, such as 
cristoballite. Small angle x-ray scattering is not observed, but would be ex- 
pected from discrete particles with breaks and voids between them. The 
scheme of bonding in glass must be essentially continuous, at least for the 
major part of the material, although the scheme of coordination about each 
atom is the same in vitreous silica and in crystalline cristoballite. The low ther- 
mal conductivity of glasses at room temperature, as discussed below, also is 
consistent with the continuous random network model. 

A comparison of experimental and calculated x-ray intensity results for 
amorphous germanium is shown in Fig. 5. The calculations are for a random 
network model and for a microcrystallite model. The latter model gives a very 
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Figure 3 Radial distribution curve for vitreous Si0 2 , as the Fourier transform of Fig. 2. The posi- 
tions of the peaks give the distances of atoms from a silicon or an oxygen. From the areas under 
the peaks it is possible to calculate the number of neighbors at that distance. The vertical lines in- 
dicate the first few average interatomic distances; the heights of the lines are proportional to the 
peak areas. (After B. E. Warren.) 



(a) (b) 


Figure 4 Schematic two-dimensional analogs illustrating the differences between: (a) the regularly 
repeating structure of a crystal and (b) continuous random network of a glass. (After Zachariasen.) 
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Figure 5 Comparison of experimental (dashed curve) and calculated (solid curve) reduced inten- 
sity function for amorphous germanium, (a) Amorphous germanium compared with microcrystal- 
lite model, (b) Amorphous germanium compared with random network model. (Results by 
]. GraczykandP. Chaudhari.) 


poor agreement. The random network model is supported for amorphous sili- 
con by studies of the band gap and spectroscopic work on the 2 p shell. 


GLASSES 

A glass has the random structure of the liquid from which it is derived by 
cooling below the freezing point, without crystallization. Also, a glass has the 
elastic properties of an isotropic solid. 

By general agreement, we say that a liquid on being cooled becomes a 
glass when the viscosity equals 10 13 poise, where a poise is the CGS unit of 
viscosity. 1 This defines the glass transition temperature T g . At temperatures 
above T e we have a liquid; below T g we have a glass. The transition is not a 
thermodynamic phase transition, only a transition for “practical purposes.” 


'The SI unit of viscosity is 1 Nsm 2 , so that 1 poise = 0.1 Nsm 2 . It is quite common to find 
viscosities given in cp or centipoise, being 10~ 2 poise. 
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The value 10 13 poise used to define T g is arbitrary, but not unreasonable. If we 
bond a slab of glass 1 cm thick to two plane parallel vertical surfaces, the glass 
will flow perceptibly in one year under its own weight when the viscosity drops 
below 10 13 poise. (For comparison, the viscosity of the mantle of the earth is of 
the order of 10 22 poise.) 

Relatively few liquids can be cooled fast enough in the bulk to form a glass 
before crystallization intervenes. Molecules of most substances have high 
enough mobility in the liquid so that on cooling a liquid-solid melting transi- 
tion occurs a long time before the viscosity increases to 10 13 poise or 10 15 cp. 
Liquid water has a viscosity 1.8 cp at the freezing point; the viscosity increases 
enormously on freezing. 

We can often make a glass by depositing a jet of atoms of a substrate 
cooled to a low temperature, a process which will sometimes produce an 
amorphous layer with glasslike properties. Amorphous ribbons of some metal 
alloys may be produced in this way in industrial quantities. 

Viscosity and the Hopping Rate 

The viscosity of a liquid is related to the rate at which molecules undergo 
thermal rearrangement on a local scale, as by hopping into a vacant neighbor 
site or by interchange of two neighbor molecules. The physics of the transport 
process is somewhat different from that of viscosity' in the gas phase, but the 
gas phase result gives a qualitative lower limit to the viscosity of the liquid 
phase, a limit that applies to nearest-neighbor hopping of atoms. 

The gas result (TP 14.34) is 

y = 5 pc(. , (14) 

w'here rj is the viscosity, p the density, c the mean thermal velocity, and ( the 
mean free path. In the liquid l is of the order of magnitude of the intermole- 
cular separation a. With “typical” values p «= 2 g cm -3 ; c ~ 10 5 cm s ; 
a «= 5 X 10~ 8 cm, we have 

Tj(min) «= 0.3 X 1CT 2 poise = 0.3 cp (15) 

as an estimate of the lower limit of the viscosity of a liquid. (Tables in chemical 
handbooks only rarelv list values below this.) 

We give now a very simple model of the viscosity of a liquid. In order to 
hop successfully, a molecule must surmount the potential energy barrier pre- 
sented by its neighbors in the liquid. The preceding estimate of the minimum 
viscosity applies when this barrier may be neglected. If the barrier is of height 
E, the molecule will have sufficient thermal energy' to pass over the barrier 
only a fraction 


f~ exp (-E/fcjT) 


(16) 
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of" the time. Here E is an appropriate free energy and is called the activation 
energy for the process that determines the rate of hopping. It is related to the 
activation energy for self-diffusion. 

The viscosity will be increased as the probability of successful hopping is 
decreased. Thus 


T] ~ r)(min)/f ~ 7j(min) exp(E/k B T) . (17) 

If tj — 10 13 poise at the glass transition, the order of magnitude of/ must be 

/ = 0.3 X KT 15 (18) 

at the transition, using (15). The corresponding activation energy is 

E/k B T g = —In/ = ln(3 X 10 15 ) = 35.6 . (19) 

If T s - 2000 K, then k B T R = 2.7 X 10“ 13 erg and E = 9.6 X 1(T 12 erg = 6 eV. 
This is a high potential energy barrier. 

Glasses with lower values of T g will have correspondingly lower values of 
E. (Activation energies obtained in this way are often labeled as £ v isc .) Materi- 
als that are glass-formers are characterized by activation energies of the order 
of 1 eV or more; non-glass-formers may have activation energies of the order 
of 0.01 cV. 

When being pressed into molds or drawn into tubes, glass is used in a 
range of temperatures at which its viscosity is 10 3 to 10 6 poises. The working 
range for vitreous silica begins over 2000'C, so high that the practical useful- 
ness of the material is severely limited. In common glass, about 25 percent of 
Na a O is added as a network modifier to Si0 2 in order to reduce below 1000°C 
the temperature needed to make the glass fluid enough for the forming opera- 
tions needed to make electric lamp bulbs, window glass, and bottles. 


AMORPHOUS FERROMAGNETS 

Amorphous metallic alloys are formed by very' rapid quenching (cooling) 
of a liquid alloy, commonly by directing a molten stream of the alloy onto the 
surface of a rapidly' rotating drum. This process produces a continuous “melt- 
spun” ribbon of amorphous alloy in industrial quantities. 

Ferromagnetic amorphous alloys were developed because amorphous ma- 
terials have nearly isotropic properties, and isotropic materials should have es- 
sentially zero magnetocrystalline anisotropy energy'. The absence of directions 
of hard and easy magnetization should result in low coercivities, low hysteresis 
losses, and high permeabilities. Because amorphous alloy's are also random al- 
loys, their electrical resistivity is high. All these properties have technological 
value for application as soft magnetic materials. The trade name Metglas is at- 
tached to several of these. 
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The transition metal-metalloid (TM-M) alloys are an important class of 
magnetic amorphous alloys. The transition metal component is usually about 
80 percent of Fe, Co, or Ni, with the metalloid component B, C, Si, P, or Al. 
The presence of the metalloids lowers the melting point, making it possible to 
quench the alloy through the glass transition temperature rapidly enough to 
stabilize the amorphous phase. For example, the composition F e S0 B 2fl (known 
as Metglas 2605) has T g = 441°C, as compared with the melting temperature 
1538°C of pure iron. 

The Curie temperature of this composition in the amorphous phase is 
647 K, and the value of the magnetization M s at 300 K is 1257, compared with 
T c = 1043 K and M s = 1707 for pure iron. The coereivity is 0.04 G, and the 
maximum value of the permeability is 3 X 10 s . Coercivities as low as 0.006 G 
have been reported for another composition. 

High coereivity materials can be produced by the same melt-spin process 
if the spin rate or quench rate is decreased to produce a fine-grained crys- 
talline phase, which may be of metastable composition. If the grain size is 
arranged to match the optimum size for single domains, the coereivity can be 



Figure 6 Coereivity at room temperature vs. melt-spin velocity v t for Sm^Fe^. The maximum 
coereivity is 24 kG and occurs at 1.65 m s _1 , which is believed to correspond to single domain be- 
havior in each crystallite. At higher spin rates the coereivity decreases because the deposited ma- 
terial becomes amorphous (more isotropic). At lower spin rates the crystallites anneal to sizes 
above the single domain regime; domain boundaries give a lower coereivity'. (After J. L. Croat.) 
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quite high (Fig. 6). J. L. Croat has reported H ci = 7.5 kG for the metastable 
alloy Nd 0 4 Fe 0 6 at the optimum melt-spin velocity 5 m s 1 . 

AMORPHOUS SEMICONDUCTORS 

Amorphous semiconductors can be prepared as thin films by evaporation 
or sputtering, or in some materials as bulk glasses by supercooling the melt. 

What happens to the electron energy band model in a solid without regu- 
lar crystalline order? The Bloch theorem is not applicable when the structure 
is not periodic, so that the electron states cannot be described by well-defined 
k values. Thus, the momentum selection rule for optical transitions is relaxed; 
hence all infrared and Raman modes contribute to the absorption spectra. The 
optical absorption edge is rather featureless. Allowed bands and energy gaps 
still occur because the form of the density of states vs. energy is determined 
most strongly by local electron bonding configurations. 

Both electrons and holes can carry current in an amorphous semiconduc- 
tor. The carriers may be scattered strongly by the disordered structure, so that 
the mean free path may sometimes be of the order of the scale of the disorder. 
Anderson proposed that the states near band edges may be localized and do 
not extend through the solid (Fig. 7). Conduction in these states may take 
place by a thermally-assisted hopping process, for which the Hall effect is 
anomalous and cannot be used to determine the carrier concentration. 

Two distinct classes of amorphous semiconductors are widely studied: 
tetrahedrally-bonded amorphous solids such as silicon and germanium, and 
the chalcogenide glasses. The latter are multicomponent solids of which one 
major constituent is a “chalcogen” element — sulfur, selenium, or tellurium. 

The tetrahedrally-bonded materials have properties similar to those of 
their crystalline forms, provided the dangling-bond defects are compensated 
with hydrogen. They can be doped with small amounts of chemical impurities, 
and their conductivity can be sharply modified by injection of free carriers 
from a metallic contact. By contrast, the chalcogenide glasses are largely in- 
sensitive to chemical impurities and to free carrier injection. 

Amorphous hydrogenated silicon is a candidate material for solar cells. 
Amorphous silicon is a much less expensive material than single crystal silicon. 



Figure 7 Density of electron states 
as believed to occur in amorphous 
solids, when states are non-localized 
in the center of the band. Localized 
states are shown shaded. The mobil- 
ity band edges E c , E’ c separate the 
ranges of energy where states are 
localized and non-localized. (After 
N. Mott and E. A. Davis.) 
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Attempts at using pure amorphous silicon, however, failed because of struc- 
tural defects (dangling bonds) which were impossible to eliminate. Introduc- 
tion of hydrogen into amorphous silicon appears to remove the undesirable 
structural defects. Relatively large proportions of hydrogen are incorporated, 
of the order of 10 percent or more. 


LOW ENERGY EXCITATIONS IN AMORPHOUS SOLIDS 


The low temperature heat capacity 7 of pure dielectric crystalline solids is 
known (Chapter 5) to follow the Debye T A law, precisely as expected from the ex- 
citation of long wavelength phonons. The same behavior was expected in glasses 
and other amorphous solids. However, many insulating glasses show an unex- 
pected linear term in the heat capacity below 1 K. Indeed, at 25 mK the observed 
heat capacity of vitreous silica exceeds the Debye phonon contribution by a fac- 
tor of 1000. Anomalous linear terms of comparable magnitude are found in all, or 
nearly all, amorphous solids. Their presence is believed to be an intrinsic conse- 
quence of the amorphous states of matter, but the details of why this is so remain 
unclear. There is strong evidence that the anomalous properties arise from two- 
level systems and not from multi-level oscillator systems; in brief, the evidence is 
that the systems can be saturated by intense phonon fields, just as a two-level 
spin system can be saturated by an intense rf magnetic field. 


Heat Capacity Calculation 

Consider an amorphous solid with a concentration N of two-level systems 
at low energies; that is, with a level splitting A much less than the phonon 
Debye cutoff k B &. The partition function of one system is, with r = k B T, 

Z = exp(A/2r) + exp( — A/2t) = 2 cos1i(A/2t) . (20) 

The thermal average energy is 

U = — |A tanh(A/2r) , (21) 

and the heat capacity of the single system is 

Cy = k B (dU/dT) = k s (A/2Tf sech 2 (A/2r) . (22) 


These results are given in detail in TP, pp. 62-63. 

Now suppose that A is distributed with uniform probability in the range 
A = 0 to A = A 0 . The average value of Cy is 


(k B / 4t 2 ) f dA (A z /Ao)sech z (A/2r) 
J n 


/ 2k B r\ f Av/2r 

U7/ Jo 


dx x 2 seclfx 


( 23 ) 


The integral cannot be evaluated in closed form. 
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Two limits are of special interest. For t < A 0 , the sech 2 x term is roughly 1 
from x — 0 to x = I, and roughly zero for x > 1. The value of the integral is 
roughly 1/3, whence 

C v * 2&§T/3A 0 , (24) 


for T < Ay/fcjj. 

For t > A 0 , the value of the integral is roughly ^(k.J2k B T'f , so that in this 
limit 

Cy = Ag/12 k B T 2 , (25) 

which approaches zero as T increases. 

Thus the interesting region is at low temperatures, for here by (24) the 
two-level system contributes to the heat capacity a term linear in the tempera- 
ture. This term, originally introduced for dilute magnetic impurities in metals, 
has no connection with the usual conduction electron heat capacity which is 
also proportional to T. 

The empirical result appears to be that all disordered solids have 
N ~ 10 1 ' cm -3 “new type” low energy excitations uniformly distributed in the 
energy interval from 0 to 1 K. The anomalous specific heat can now be ob- 
tained from (24). For T — 0.1 K and A — 1 K, 

C v ~ §Mc s (0.1) 1 erg cm -3 KC 1 . (26) 

For comparison, the phonon contribution at 0.1 K is, from (5.35), 

C v «■ 2MNk B {T!B? « (234)(2.3 X ltf 2 )(\M X l(T 16 )(0.1/300) 3 
« 2.8 X 10 2 erg cm” 3 IT 1 , 

much smaller than (26). 

The experimental results (Fig. 8) for vitreous Si0 2 are represented by 

C v = C]T + c 3 T> , (28) 

where c l = 12 erg g _1 K~ 2 and c 3 = 18 erg g _1 K~ 4 . 

Thermal Conductivity 

The thermal conductivity of glasses is very low. It is limited at room tem- 
perature and above by the scale of the disorder of the structure, for this scale 
determines the mean free path of the dominant thermal phonons. At low tem- 
peratures, below 1 K, the conductivity is carried by long wavelength phonons 
and is limited by phonon scattering from the mysterious two-level systems or 
tunneling states discussed earlier for their contribution to the heat capacity of 
amorphous solids. 
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As in Chapter 5, the expression for the thermal conductivity K has the form 

K = 5 mi , (29) 

where c is the heat capacity per unit volume, v is an average phonon velocity, 
and f is the phonon mean free path. For vitreous silica at room temperature, 

K= 1.4 X 10 — 2 J cm" 1 s -1 K' 1 ; 

c - 1 .6 J cm 3 K 1 ; 

(u) = 4.2 X 10’ cm s _1 . 

Thus the mean free path t = 6 X 10 ~ 8 cm; by reference to Fig. 3 we see that 
this is of the order of magnitude of the disorder of the structure. 



Temperature, K 

Figure 8 Heat capacity of vitreous silica and soda silica glass as a function of temperature. The 
heat capacity is roughly linear in T below 1 K. The dashed line represents the calculated Debye 
heat capacity of vitreous silica. 
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Figure 9 Short phonon mean free path in a disordered structure. A short wavelength phonon 
that displaces atom L, as shown, will displace atom R by a much smaller distance, because of the 
phase cancellation of the upper and lower paths from L to R. The displacement of R is T + i - o, 
so that the wave incident from L is reflected at 11. 
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This value of the phonon mean free path is remarkably small. At room 
temperature and above (that is, above the Debye temperature), most of the 
phonons have half-wavelengths of the order of the interatomic spacing. It is 
through phase cancellation processes, as in Fig. 9, that the mean free path is 
limited to several interatomic spacings. No other structure for fused quartz 
will give a 6 A mean free path. The normal modes of vibration of the glass 
structure are utterly unlike plane waves. But the modes, as distorted as they 
are, still have quantized amplitudes and therefore may be called phonons. 

FIBER OPTICS 

Fibers of silica-based lightguides carry a high proportion of the data and 
information transmitted on the surface of the earth and under the seas. The 
optical fibers consist of a thin core (—10 /Ain) of high-refractive index glass 
surrounded by a cladding. The digital data are carried by light, with a mini- 
mum attenuation near 0.20 db km -1 at wavelengths near 1.55 /im . which is in 
the infrared (Fig. 10). A range of 100 km corresponds to a loss of 20 db, power 
readily supplied by an Eu 3+ laser amplifier. 

The optic window of high-purity glasses near this wavelength is limited on 
the low frequency side by phonon absorption bands and on the high frequency 
side by Rayleigh scattering, and, ultimately, by electronic absorption. In the 



Figure 10 The transmission characteristic of communication-quality optical fibers, showing the 
attenuation in units of decibels per km as a function of the wavelength of light, in (L m. The 
Rayleigh scattering regime is dominant on the left of the curve, except for a strong impurity ab- 
sorption line associated with OH ions that accompany Si0 2 ; the line is the second harmonic of a 
line at 2.7 /im known as the “water line.” The wavelength marked at 1.31 fim is used in 1994 trans- 
mission lines; it was replaced by the wavelength 1.55 fim available from Eu* + ion amplifiers, 
which arc used every 100 km in typical long distance applications. The power needed to pump the 
amplifiers is supplied by copper wires. (Courtesy of Tingyc Li, AT&T Bell Laboratories.) 
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optic window the losses are determined by the Rayleigh scattering intrinsic to 
static fluctuations in the local dielectric constant of an inhomogeneous 
medium, and the attenuation varies as the fourth power of the frequency. 

It is fortunate that an excellent source is available for radiation at 1.55 /xm. 
As shown in Fig. 13.24, excited (pumped) erbium Er 3+ ions can amplify in an 
erbium-doped section of fiber. 

Rayleigh Attenuation 

The attenuation of light waves in glass is dominated at wavelengths in the 
infrared by the same scattering process, called Rayleigh scattering, that is re- 
sponsible for the blue light of the sky. The extinction coefficient, or attenua- 
tion coefficient, h, has the dimension of reciprocal length and for light scat- 
tered in a gas is given, after Rayleigh, by 

h = (2to i /3TTc 4 N)((n - l) 2 ) , (30) 

where n is the local refractive index and N is the number of scattering centers 
per unit volume. The energy flux as a function of distance has the form 
exp(— hx). 

Derivations of (30) are found in good texts on electrodynamics; the struc- 
ture of the result may be understood by a general argument: The radiant en- 
ergy scattered from a dipole element p is proportional to (. dp 2 /dt 2 ) 2 , and this 
accounts for the factor oj 4 . The local polarizability a enters as a 2 ; if there are N 
random scattering centers per unit volume, the scattered energy averaged over 
these random sources will go as N{( Aa) 2 ), or ((A n) 2 )/N. Thus we have the es- 
sential factors that appear in (30). As applied to a glass. An should refer to the 
variations in polarization around each group of Si — O bonds, and satisfactory 7 
numerical estimates of the attenuation may be made in this way. 

Problem 

1. Metallic optic fibres? It has been speculated that metallic wires can act as optic 
fibres, transmitting light with a long delay appropriate to the high refractive index 
characteristic of metals. Unfortunately the refractive index of a typical metal is 
dominated by a free-electron term in i m , so that the propagation of a light wave is in 
fact highly damped in a metal. Show that in sodium at room temperature a wave of 
vacuum wavelength 10 pm will have a damping length of 0.1 /am. This may be con- 
trasted with the 100 km damping length found for light in high-quality glass fibres. 
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Figure 1 A plane of a pure alkali halide crystal, showing two vacant positive ion sites, and a 
coupled pair of vacant sites of opposite sign. 
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CHAPTER 20: POINT DEFECTS 


The common point imperfections in crystals are chemical impurities, 
vacant lattice sites, and extra atoms not in regular lattice positions. Linear im- 
perfections are treated under dislocations, Chapter 21. The crystal surface is a 
planar imperfection, with electron, phonon, and magnon surface states. 

Some important properties of crystals are controlled as much by imperfec- 
tions as by the composition of the host crystal, which may act only as a solvent 
or matrix or vehicle for the imperfections. The conductivity of some semicon- 
ductors is due entirely to trace amounts of chemical impurities. The color and 
luminescence of many crystals arise from impurities or imperfections. Atomic 
diffusion may be accelerated enormously by impurities or imperfections. 
Mechanical and plastic properties are usually controlled by imperfections. 


LATTICE VACANCIES 

The simplest imperfection is a lattice vacancy, which is a missing atom or 
ion, also known as a Schottky defect. (Fig. 1). We create a Schottky defect in 
a perfect crystal by transferring an atom from a lattice site in the interior to a 
lattice site on the surface of the crystal. In thermal equilibrium a certain num- 
ber of lattice vacancies are always present in an otherwise perfect crystal, be- 
cause the entropy is increased by the presence of disorder in the structure. 

In metals with close-packed structures the proportion of lattice sites va- 
cant at temperatures just below the melting point is of the order of 1 0 3 to 
10 -4 . But in some alloys, in particular the very hard transition metal carbides 
such as TiC, the proportion of vacant sites of one component can be as high as 
50 percent. 

The probability that a given site is vacant is proportional to the Boltzmann 
factor for thermal equilibrium: P = exp(— E v /k s T), where F. v is the energy re- 
quired to take an atom from a lattice site inside the crystal to a lattice site on 
the surface. If there are N atoms, the equilibrium number n of vacancies is 
given by the Boltzmann factor 

= exp (~E v /k B T) . (1) 

If n N, then 

n/N = exp(— E v /k B T) . (2) 

If E v ~ 1 eV and T ~ 1000 K, then n/N « e~ lz - 10“ 5 . 

The equilibrium concentration of vacancies decreases as the tempera- 
ture decreases. The actual concentration of vacancies will be higher than the 
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Figure 2 Schottky and Frenkel defects in an ionic crystal. The arrows indicate the displacement 
of the ions. In a Schottky defect the ion ends up on the surface of the crystal; in a Frenkel defect 
the ion is removed to an interstitial position. 


equilibrium value if the crystal is grown at an elevated temperature and then 
cooled suddenly, thereby freezing in the vacancies (see the discussion of diffu- 
sion below). 

In ionic crystals it is usually energetically favorable to form roughly equal 
numbers of positive and negative ion vacancies. The formation of pairs of va- 
cancies keeps the crystal electrostatically neutral on a local scale. From a sta- 
tistical calculation we obtain 

n=N exp(—E p /2k B T) (3) 

for the number of pairs, where E p is the energy of formation of a pair. 

Another vacancy defect is the Frenkel defect (Fig. 2) in which an atom is 
transferred from a lattice site to an interstitial position, a position not nor- 
mally occupied by an atom. In pure alkali halides the most common lattice va- 
cancies are Schottky defects; in pure silver halides the most common vacancies 
are Frenkel defects. The calculation of the equilibrium number of Frenkel 
defects proceeds along the lines of Problem 1. If the number n of Frenkel 
defects is much smaller than the number of lattice sites N and the number of 
interstitial sites N', the result is 

n = (NN') y2 exp(—E,/2k B T) , (4) 

where Ej is the energy necessary to remove an atom from a lattice site to an 
interstitial position. 

Lattice vacancies are present in alkali halides when these contain addi- 
tions of divalent elements. If a crystal of KCl is grown with controlled amounts 
of CaCl 2 , the density varies as if a K + lattice vacancy were formed for each 
Ca 2+ ion in the crystal. The Ca 2+ enters the lattice in a normal K + site and the 
two Cl - ions enter two Cl - sites in the KCl crystal (Fig. 3). Demands of charge 
neutrality result in a vacant metal ion site. The experimental results show that 
the addition of CaCl 2 to KCl lowers the density of the crystal. The density 
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Figure 3 Production of a lattice vacancy by the solution of CaCl 2 in KC1: to ensure electrical 
neutrality, a positive ion vacancy is introduced into the lattice with each divalent cation Ca ++ . The 
two Cl" ions of CaCl 2 enter normal negative ion sites. 
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(a) (b) (c) 


Figure 4 Three basic mechanisms of diffusion: (a) Interchange by rotation about a midpoint. 
More than two atoms may rotate together, (b) Migration through interstitial sites, (c) Atoms 
exchange position with vacant lattice sites. (After Seitz.) 


would increase if no vacancies were produced, because Ca 2+ is a heavier and 
smaller ion than K + . 

The mechanism of electrical conductivity in alkali and silver halide ciys- 
tals is usually by the motion of ions and not by the motion of electrons. This 
has been established by comparing the transport of charge with the transport 
of mass as measured by the material plated out on electrodes in contact with 
the crystal. 

The study of ionic conductivity is an important tool in the investigation of 
lattice defects. Work on alkali and silver halides containing known additions of 
divalent metal ions shows that at not too high temperatures the ionic conduc- 
tivity is directly proportional to the amount of divalent addition. This is not be- 
cause the divalent ions are intrinsically highly mobile, for it is predominantly 
the monovalent metal ion which deposits at the cathode. The lattice vacancies 
introduced with the divalent ions are responsible for the enhanced diffusion 
(Fig. 4c). The diffusion of a vacancy in one direction is equivalent to the diffu- 
sion of an atom in the opposite direction. When lattice defects are generated 
thermally, their energy of formation gives an extra contribution to the heat 
capacity of the crystal, as shown in Fig. 5. 
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Figure 5 Heat capacity of silver bromide exhibiting the excess heat capacity from the formation 
of lattice defects. (After R. W. Christy and A. W. Lawson.) 

An associated pair of vacancies of opposite sign exhibits an electric dipole 
moment, with contributions to the dielectric constant and dielectric loss due to 
the motion of pairs of vacancies. The dielectric relaxation time is a measure of 
the time required for one of the vacant sites to jump by one atomic position 
with respect to the other. The dipole moment can change at low frequencies, 
but not at high. In sodium chloride the relaxation frequency is 1000 s ~ 1 at 85°C. 


DIFFUSION 

When there is a concentration gradient of impurity atoms or vacancies in a 
solid, there will be a flux of these through the solid. In equilibrium the impuri- 
ties or vacancies wall he distributed uniformly. The net flux / v of atoms of one 
species in a solid is related to the gradient of the concentration N of this 
species by a phenomenological relation called Fick’s law: 

/v = -D grad .V . (5) 

Here J N is the number of atoms crossing unit area in unit time; the constant D 
is the diffusion constant or diffusivity and has the units cm 2 /s or m 2 /s. The 
minus sign means that diffusion occurs away from regions of high concentra- 
tion. The form (5) of the law of diffusion is often adequate, but rigorously the 
gradient of the chemical potential is the driving force for diffusion and not the 
concentration gradient alone ( TF , p. 406). 

The diffusion constant is often found to vary with temperature as 

D = D 0 exp(-E/k e T) ; (6) 

here E is the activation energy for the process. Experimental results on the 
diffusion of carbon in alpha iron are shown in Fig. 6. The data are represented 
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Figure 6 Diffusion coefficient of carbon in iron, after Wert. The logarithm of D is directly 
proportional to 1/T. 


Table 1 Diffusion constants and activation energies 


Host 

crystal 

Atom 

D 0 

cm 2 s 1 

E 

eV 

Host 

crystal 

Atom 

Do 

cm 2 s 1 

E 

eV 

Cu 

Cu 

0.20 

2.04 

Si 

A1 

8,0 

3.47 

Cu 

Zn 

0.34 

1.98 

Si 

Ca 

3.6 

3.51 

A g 

Ag 

0.40 

1.91 

Si 

In 

16.0 

3.90 

A g 

Cu 

1.2 

2.00 

Si 

As 

0.32 

3.56 

A g 

Au 

0.26 

1.98 

Si 

Sb 

5.6 

3.94 

A g 

Pb 

0.22 

1.65 

Si 

Li 

2 X 10~ 3 

0.66 

Na 

Na 

0.24 

0.45 

Si 

Au 

1 X 10" 3 

1.13 

U 

U 

2 X ]0 _) 

1.20 

Ge 

Ce 

10.0 

3.1 


by E = 0.87 cV D 0 = 0.020 cm 2 /s. Representative values of D 0 and E are given 
in Table 1. 

To diffuse, an atom must overcome the potential energy barrier presented 
by its nearest neighbors. We treat the diffusion of impurity atoms between 
interstitial sites. The same argument will apply to the diffusion of vacant 
lattice sites. If the barrier is of height E, the atom will have sufficient thermal 
energy to pass over the harrier a fraction exp(— E/k B T) of the time. Quantum 
tunneling through the barrier is another possible process, hut is usually impor- 
tant only for the lightest nuclei, particularly hydrogen. 


590 


If v is a characteristic atomic vibrational frequency, then the probability p 
that during unit time the atom will have enough thermal energy to pass over 
the barrier is 


p~vexp(-E/k B D ■ (7) 

In unit time the atom makes v passes at the barrier, with a probability 
exp (~E/k B T) of surmounting the barrier on each try. The quantity' p is called 
the jump frequ ency. 

We consider two parallel planes of impurity atoms in interstitial sites. The 
planes are separated by lattice constant a. There are S impurity atoms on one 
plane and (S + a dS/dx ) on the other. The net number of atoms crossing be- 
tween the planes in unit time is ~ —pa dS/dx. If N is the total concentration of 
impurity atoms, then S = aN per unit area of a plane. 

The diffusion flux may now be written as 

Jn s= ~pa 2 (dN/dx) . (8) 


On comparison with (5) wc have the result 

D = va 2 exp(— E/k B T) . 


(9) 


of the fonn (6) with D 0 — va 2 . 

If the impurities are charged, we may find the ionic mobility p. and the 
conductivity cr from the diffusivity by using the Einstein relation k B Tp — qD 
from TP , p. 406: 


p = ( qva 2 /k e T ) exp {—E/k B T) ; 


( 10 ) 


cr = Nqp — ( Nq 2 va 2 /k B T ) exp (~E/k B T) , (11) 

where N is the concentration of impurity ions of charge q. 

The proportion of vacancies is independent of temperature in the range in 
which the number of vacancies is determined by the number of divalent metal 
ions. Then the slope of a plot of In (r versus 1 /k B T gives E + , the barrier activa- 
tion energy' for the jumping of positive ion vacancies (Table 2). Diffusion is 
very slow at low temperatures. At room temperature the jump frequency is of 
the order of Is -1 , and at 100 K it is of the order of 10 -2j s _1 . 

The proportion of vacancies in the temperature range in which the con- 
centration of defects is determined by thermal generation is given by 

/ s exp(—E f /2k B T) , (12) 

where Ey is the energy of formation of a vacancy pair, according to the theory 
of Schottky or Frenkel defects. Here the slope of a plot of In cr versus 1 /k B T 
will be E + + \Ef, according to (10) and (12). From measurements in different 
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Table 2 Activation energy E + for motion of a positive ion vacancy 


Values of the energy of formation of a vacancy pair, Ep are also given. The 
numbers given in parentheses for the silver salts refer to interstitial silver ions. 


Crystal 

£ + (eV) 

Ej{e\’) 

Workers 

NaCl 

0.86 

2.02 

Etzcl and Maurer 

LiF 

0.65 

2.68 

Haven 

LiCI 

0.41 

2.12 

Haven 

LiBr 

0.31 

1.80 

Haven 

Lil 

0.38 

1.34 

Haven 

KCl 

0.89 

2. 1-2. 4 

Wagner; Kelting and Witt 

AgCl 

0.39(0.10) 

1.4“ 

Teltow 

AgBr 

0.25(0.11) 

i.r 

Compton 


“For Frenkel defect. 


temperature ranges we determine the energy of formation of a vacancy pair E f 
and the jump activation energy E + . 

The diffusion constant can he measured by radioactive tracer techniques. 
The diffusion of a known initial distribution of radioactive ions is followed as 
a function of time or distance. Values of the diffusion constant thus deter- 
mined may be compared with values from ionic conductivities. The two sets of 
values do not usually agree within the experimental accuracy, suggesting the 
presence of a diffusion mechanism that does nol involve the transport of 
charge. For example, the diffusion of pairs of positive and negative ion vacan- 
cies does not involve the transport of charge. 

Metals 

Self-diffusion in monatomic metals most commonly proceeds by lattice va- 
cancies. Self-diffusion means the diffusion of atoms of the metal itself, and 
not of impurities. The activation energy’ for self-diffusion in copper is expected 
to be in the range 2.4 to 2.7 eV for diffusion through vacancies and 5.1 to 
6.4 eV for diffusion through interstitial sites. Observed values of the activation 
energy are 1.7 to 2.1 eV. 

Activation energies for diffusion in Li and Na can be determined from 
measurements of the temperature dependence of the nuclear resonance line 
width. As discussed under magnetic resonance, the resonance line width 
narrows when the jump frequency of an atom between sites becomes rapid in 
comparison with the frequency corresponding to the static line width. The val- 
ues 0.57 eV and 0.45 eV were determined by NMR for Li and Na. Self-diffu- 
sion measurements for sodium also give 0.4 cV. 
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COLOR CENTERS 

Pure alkali halide crystals are transparent throughout the visible region 
of the spectrum. A color center is a lattice defect that absorbs visible light. 
An ordinary lattice vacancy does not color alkali halide crystals, although it af- 
fects the absorption in the ultraviolet. The crystals may be colored in a number 
of ways: 

• by the introduction of chemical impurities; 

• by the introduction of an excess of the metal ion (we may heat the crystal in 
the vapor of the alkali metal and then cool it quickly — an NaCl crystal heated 
in the presence of sodium vapor becomes yellow; a KC1 crystal heated in 
potassium vapor becomes magenta); 

• by x-ray, y-ray, neutron, and electron bombardment; and 

• by electrolysis. 

F Centers 

The name F center comes from the German word for color, Farbe. We 
usually produce F centers by heating the crystal in excess alkali vapor or by 
x-irradiation. The central absorption band (F band) associated with F centers 
in several alkali halides are shown in Fig. 7, and the quantum energies are 
listed in Table 3. Experimental properties of F centers have been investigated 
in detail, originally by Pohl. 

The F center has been identified by electron spin resonance as an electron 
bound at a negative ion vacancy (Fig. 8), in agreement with a model suggested 
by de Boer. When excess alkali atoms are added to an alkali halide crystal, a 
corresponding number of negative ion vacancies are created. The valence 
electron of the alkali atom is not bound to the atom; the electron migrates in 
the crystal and becomes bound to a vacant negative ion site. A negative ion va- 
cancy in a perfect periodic lattice has the effect of an isolated positive charge: 
it attracts and binds an electron. We can simulate the electrostatic effect of 


Wavelength in A 

4000 6000 4000 6000 4000 6000 4000 6000 4000 6000 


o 



4 3 24 3 24 3 24 3 24 3 2 


Energy in eV 

Figure 7 The F bands for several alkali halides: optical absorption versus wavelength for crystals 
that contain F centers. 
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Table 3 Experimental F center absorption 
energies, in eV 


LiCl 

3.1 

NaBr 

2.3 

NaCl 

2.7 

KBr 

2.0 

KCI 

2.2 

RbBr 

1.8 

RbCl 

2.0 

LiF 

5.0 

CsCl 

2.0 

NaF 

3.6 

LiBr 

2.7 

KF 

2.7 



Figure 8 An F center is a negative ion 
vacancy with one excess electron bound at 
the vacancy. The distribution of the excess 
electron is largely on the positive metal ions 
adjacent to the vacant lattice site. 






Figure 9 An F A center in KCI: one of the six K + ions 
which bind an F center is replaced by another alkali ion, 
here Na + . 


a negative ion vacancy by adding a positive charge q to the normal charge —q 
of an occupied negative ion site. 

The F center is the simplest trapped-electron center in alkali halide crys- 
tals. The optical absorption of an F center arises from an electric dipole transi- 
tion to a bound excited state of the center. 

Other Centers in Alkali Halides 

In the F a center one of the six nearest neighbors of an F center has been 
replaced by a different alkali ion. Fig. 9. More complex trapped-electron cen- 
ters are formed by groups of F centers. Fig. 10 and 11. Thus two adjacent 
F centers form an M center. Three adjacent F centers form an R center. Dif- 
ferent centers are distinguished by their optical absorption frequencies. 
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Figure 10 An M center consists of two 
adjacent F centers. 



Figure 11 An R center consists of 
three adjacent F centers; that is, a 
group of three negative ion vacancies 
in a [111] plane of the NaCl struc- 
ture, with three associated electrons. 


- - 0 - ~ . 

- © o' - - 


Cl- 

K* 
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K* 
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ft 


ci- 

K* 

ci- 

K + 

Cl- 

ci- 

O 
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K * Y' 

2 

K + 
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Cl“ 

K* 



ci~ 

Cl- 

K + 

ci- 

K* 

ci- 

K + 


Figure 12 A V K center formed when a hole is trapped by a pair of negative ions resembles a neg- 
ative halogen molecule ion, which is CI 2 in KCl. No lattice vacancies or extra atoms are involved in 
a V K center. The center at the left of the figure probably is not stable: the hexagon represents a 
hole trapped near a positive ion vacancy; such a center would be the antimorph to an F center. 
Holes have a lower energy trapped in a V K center than in an anti-F center. 



20 Point Defects 


595 


Holes may be trapped to form color centers, but hole centers are not usu- 
ally as simple as electron centers. For example, a hole in the filled p 6 shell of a 
halogen ion leaves the ion in a p D configuration, whereas an electron added to 
the filled p 6 shell of an alkali ion leaves the ion in a p 6 s configuration. 

The chemistry of the two centers is different: p 6 s acts as a spherically sym- 
metric ion, but p 5 acts as an asymmetric ion and, by virtue of the Jahn-Teller 
effect, will distort its immediate surroundings in the crystal. 

The antimorph to the F center is a hole trapped at a positive ion vacancy, 
but no such center has been identified experimentally in alkali halides; in 
insulating oxides the O - (called V - ) defect is known. The best-known trapped- 
hole center is the V K center. Fig. 12. The V K center is formed when a hole is 
trapped by a halogen ion in an alkali halide crystal. Electron spin resonance 
shows that the center is like a negative halogen molecular ion, such as CI 2 in 
KCl. The Jahn-Teller trapping of free holes is the most effective form of self- 
trapping of charge carriers in perfect crystals. 

Problems 

1. Frenkel defects. Show that the number n of interstitial atoms in equilibrium with n 
lattice vacancies in a crystal having N lattice points and N' possible interstitial posi- 
tions is given by the equation 

E, = k B T In [( N - n)(N' - n)/n 2 3 ] , 

whence, for n <€ N, N', we have n — (NN') m exp(— E,/2k B T). Here E, is the energy 
necessary to remove an atom from a lattice site to an interstitial position. 

2. Schottky vacancies. Suppose that the energy required to remove a sodium atom 
from the inside of a sodium crystal to the boundary is 1 eV. Calculate the concentra- 
tion of Schottky vacancies at 300 K. 

3. F center, (a) Treat an F center as a free electron of mass in moving in the field of 
a point charge e in a medium of dielectric constant e = n 2 ; what is the ls-2p energy 
difference of F centers in NaCl? (b) Compare from Table 3 the F center excitation 
energy in NaCl with the 3s-3p energy difference of the free sodium atom. 
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Figure 1 (a) Relative shear of two planes of atoms (shown in section) in a uniformly strained crys- 
tal; (b) shear stress as a function of the relative displacement of the planes from their equilibrium 
position. The heavy broken line drawn at the initial slope defines the shear modulus G. 



CHAPTER 21 : DISLOCATIONS 


This chapter is concerned with the interpretation of the plastic mechani- 
cal properties of crystalline solids in terms oi the theory of dislocations. Plastic 
properties are irreversible deformations; elastic properties are reversible. The 
ease with which pure single crystals deform plastically is striking. This intrinsic 
weakness of crystals is exhibited in various ways. Pure silver chloride melts at 
455°C, yet at room temperature it has a cheeselike consistency and can be 
rolled into sheets. Pure aluminum crystals are elastic (follow Hooke s law) only 
to a strain of about 10 after which they deform plastically. 

Theoretical estimates of the strain at the elastic limit of perfect crystals 
may give values 10 s or 10 4 higher than the lowest observed values, although a 
factor 10 2 is more usual. There are few exceptions to the rule that pure crystals 
are plastic and are not strong: crystals of germanium and silicon are not plastic 
at room temperature and fail or yield only by fracture. Glass at room tempera- 
ture fails only by fracture, but it is not crystalline. The fracture ol glass is 
caused by stress concentration at minute cracks. 


SHEAR STRENGTH OF SINGLE CRYSTALS 

Frenkel gave a simple method of estimating the theoretical shear strength 
of a perfect crystal. We consider in Fig. 1 the force needed to make a shear 
displacement of two planes of atoms past each other. For small elastic strains, 
the stress cr is related to the displacement x by 

cr = Gx/d . (1) 

Here d is the interplanar spacing, and G denotes the appropriate shear 
modulus. When the displacement is large and has proceeded to the point that 
atom A is directly over atom B in the figure, the two planes of atoms are in a 
configuration of unstable equilibrium and the stress is zero. As a first approxi- 
mation we represent the stress-displacement relation by 

a = (Ga/2ird) sin (2m/a) , (2) 

where a is the interatomic spacing in the direction of shear. This relation is con- 
structed to reduce to (1) for small values of x/a. The critical shear stress a c at 
which the lattice becomes unstable is given by the maximum value of cr, or 

cr,. = Ga/2vd . (3) 

If a = d, then cr c ~ G/2ir: the ideal critical shear stress is of the order of g of 
the shear modulus. 


599 


600 


Table 1 Comparison of shear modulus G and observed elastic limit <r c " 


Shear modulus G, Elastic limit cr t ,, 

iu dyii/cnr in livn/cin G/<T C 


Sn, single crystal 

1.9 

RiSSie 

X 

10 11 

1.3 

X 

10 7 

15,000 

Ag, single crystal 

2.8 

X 

10 11 

6 

X 

TO 6 

45,000 

Al, single crystal 

2.5 

X 

10 11 

4 

X 

SO 

o 

H 

60,000 

Al, pure, polycrystal 

2.5 

X 

10 11 

2.6 

X 

10 8 

900 

Al, commercial drawn 

—2.5 

X 

10 u 

9.9 

X 

I0 8 

250 

Duralumin 

—2.5 

X 

10 11 

3.6 

X 

10® 

70 

Fe, soft, polycrystal 

7.7 

X 

10 11 

1.5 

X 

10® 

500 

Heat-treated carbon steel 

-8 

X 

10 l[ 

6.5 

X 

10® 

120 

Nickel-chrome steel 

-8 

X 

10 11 

1.2 

X 

10 1 ® 

65 


’After Mott 


The observations in Table 1 show the experimental values of the elastic 
limit are much smaller than (3) would suggest. The theoretical estimate may 
be improved by consideration of the actual form of the intermolecular forces 
and by consideration of other configurations of mechanical stability available 
to the lattice as it is sheared. Mackenzie has shown that these two effects may 
reduce the theoretical ideal shear strength to about G/30, corresponding to a 
critical shear strain angle of about 2 degrees. The observed low values of the 
shear strength can be explained only by the presence of imperfections that 
can act as sources of mechanical weakness in real crystals . The movement of 
crystal imperfections called dislocations is responsible for slip at very low 
applied stresses. 

Slip 

Plastic deformation in crystals occurs by slip, an example of which is 
shown in Fig. 2. In slip, one part of the crystal slides as a unit across an adja- 
cent part. The surface on which slip takes place is known as the shp plane. The 
direction of motion is known as the slip direction. The great importance of lat- 
tice properties for plastic strain is indicated by the highly anisotropic nature of 
slip. Displacement takes place along crystallographic planes with a set of small 
Miller indices, such as the {111} planes in fee metals and the (HO), {112}, and 
{123} planes in bcc metals. 

The slip direction is in the line of closest atomic packing, (110) in fee 
metals and (111) in bcc metals (Problem 1). To maintain the crystal structure 
after slip, the displacement or slip vector must equal a lattice translation vec- 
tor. The shortest lattice translation vector expressed in terms of the lattice con- 
stant a in a fee structure is of the form (a/2)(x + y); in a bcc structure it is 
(a/ 2)(x + y + z). But in fee crystals one also observes partial displacements 
which upset the regular sequence ABCABC ... of closest-packed planes, to 
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Figure 2 Translational slip in zinc single crystals. (E. R. Parker.) 


produce a stacking fault such as ABC AB ABC. . . . The result is then a mix- 
ture of fee and hep stacking. 

Deformation by slip is inhomogeneous: large shear displacements occur 
on a few widely separated slip planes, while parts of the crystal lying between 
slip planes remain essentially undeformed. A property of slip is the Schmid law 
of the critical shear stress: slip takes place along a given slip plane and direction 
when the corresponding component of shear stress reaches the critical value. 

Slip is one mode of plastic deformation. Another mode, twinning, is ob- 
served particularly in hep and bcc structures. During slip a considerable dis- 
placement occurs on a few widely separated slip planes. During twinning, a 
partial displacement occurs successively on each of many neighboring crystal- 
lographic planes. After twinning, the deformed part of the crystal is a mirror 
image of the undeformed part. Although both slip and twinning are caused by 
the motion of dislocations, we shall be concerned primarily with slip. 


DISLOCATIONS 

The low observed values of the critical shear stress are explained in terms 
of the motion through the lattice of a line imperfection known as a dislocation. 
The idea that slip propagates by the motion of dislocations was published in 
1934 independently by Taylor, Orowan, and Polanyi; the concept of dislocations 
was introduced somewhat earlier by Prandtl and Dehlinger. There are several 
basic types of dislocations. We first describe an edge dislocation. Figure 3 
shows a simple cubic crystal in which slip of one atom distance has occurred 
over the left half of the slip plane but not over the right half. The boundary be- 
tween the slipped and unslipped regions is called the dislocation. Its position is 
marked by the termination of an extra vertical half-plane of atoms crowded into 
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Figure 3 An edge dislocation EF in the glide plane ABCD. The figure shows the slipped region 
ABEF in which the atoms have been displaced by more than half a lattice constant and the un- 
slipped region FECD with displacement less than half a lattice constant. 


Figure 4 Structure of an edge disloca- 
tion. The deformation may be thought 
of as caused by inserting an extra plane 
of atoms on the upper half of the y axis. 
Atoms in the upper half-crystal are 
compressed by the insertion; those in 
the lower half are extended. 



the upper half of the crystal as shown in Fig. 4. Near the dislocation the crystal 
is highly strained. The simple edge dislocation extends indefinitely in the slip 
plane in a direction normal to the slip direction. In Fig. 5 we show a photo- 
graph of a dislocation in a two-dimensional soap bubble raft obtained by the 
method of Bragg and Nye. 

The mechanism responsible for the mobility of a dislocation is shown in 
Fig. 6. The motion of an edge dislocation through a crystal is analogous to the 
passage of a ruck or wrinkle across a rug; the ruck moves more easily than the 
whole rug. If atoms on one side of the slip plane are moved with respect to 
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Figure 5 A dislocation in a two-dimensional bubble raft. The dislocation is most easily seen by 
turning the page by 30° in its plane and sighting at a low angle. (W. M. Lomer, after Bragg and Nye.) 



Figure 6 Motion of a dislocation 
under a shear tending to move the 
upper surface of the specimen to the 
right. (D. Hull.) 


those on the other side, atoms at the slip plane will experience repulsive forces 
from some neighbors and attractive forces from others across the slip plane. 
These forces cancel to a first approximation. The external stress required to 
move a dislocation has been calculated and is quite small, below 10 5 dyn/cm 2 
when the bonding forces in the crystal are not highly directional. Thus dis- 
locations may make a crystal very plastic. Passage of a dislocation through a 
crystal is equivalent to a slip displacement of one part of the crystal. 

The second simple type of dislocation is the screw dislocation, sketched 
in Figs. 7 and 8. A screw dislocation marks the boundary between slipped and 
unslipped parts of the crystal. The boundary parallels the slip direction, in- 
stead of lying perpendicular to it as for the edge dislocation. The screw dis- 
location may be thought of as produced by cutting the crystal partway through 
with a knife and shearing it parallel to the edge of the cut by one atom spacing. 
A screw dislocation transforms successive atom planes into the surface of a 
helix; this accounts for the name of the dislocation. 
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Figure 7 A screw dislocation. A part ABEF of the slip plane has slipped in the direction parallel to 
the dislocation line EF. A screw dislocation may be visualized as a helical arrangement of lattice 
planes, such that we change planes on going completely around the dislocation line. (After Cottrell.) 


Figure 8 Another view of a screw dislocation. The bro- 
ken vertical line that marks the dislocation is surrounded 
by strained material. 

Burgers Vectors 

Other dislocation forms may be constructed from segments of edge and 
screw dislocations. Burgers has shown that the most general form of a linear 
dislocation pattern in a crystal can be described as shown in Fig. 9. We con- 
sider any closed curve within a crystal, or an open curve terminating on the 
surface at both ends: (a) Make a cut along any simple surface bounded by the 
line, (b) Displace the material on one side of this surface by a vector b relative 
to the other side; here b is called the Burgers vector, (c) In regions where b 
is not parallel to the cut surface, this relative displacement will either produce 
a gap or cause the two halves to overlap. In these cases we imagine that we 
either add material to fill the gap or subtract material to prevent overlap, (d) 
Rejoin the material on both sides. We leave the strain displacement intact at 
the time of rewelding, but afterwards we allow the medium to come to internal 
equilibrium. The resulting strain pattern is that of the dislocation character- 
ized jointly by the boundary curve and the Burgers vector. The Burgers vector 
must be equal to a lattice vector in order that the rewelding process will main- 
tain the crystallinity of the material. The Burgers vector of a screw dislocation 
(Figs. 7 and 8) is parallel to the dislocation line; that of an edge dislocation 
(Figs. 3 and 4) is perpendicular to the dislocation line and lies in the slip plane. 
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Figure 9 General method of forming a dislocation ring in a medium. The medium is represented 
by the rectangular block. The ring is represented by the closed curve in the interior in the block. A 
cut is made along the surface bounded by the curve and indicated by the contoured area. The ma- 
terial on one side of the cut is displaced relative to that on the other by vector distance b, which 
may be oriented arbitrarily relative to the surface. Forces will be required to effect the displace- 
ment. The medium is filled in or cut away so as to be continuous after the displacement. It is then 
joined in the displaced state and the applied forces are relaxed. Here b is the Burgers vector of the 
dislocation. (After Seitz.) 



Figure 10 Shell of elastically distorted crystal surrounding 
screw dislocation with Burgers vector b; see also Fig. 16. 


Stress Fields of Dislocations 

The stress field of a screw dislocation is particularly simple. Figure 10 
shows a shell of material surrounding an axial screw dislocation. The shell of 
circumference 2ttv has been sheared by an amount b to give a shear strain 
e = fe/2iTr The corresponding shear stress in an elastic continuum is 


a — Ge = Gb/'lirr . 


(4) 
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This expression does not hold in the region immediately around the disloca- 
tion line, as the strains here are too large for continuum or linear elasticity 
theory to apply. The elastic energy of the shell is d.E, = | Ge 2 dV = (Gh 2 /4Tr) dr/r 
per unit length. The total elastic energy per unit length of a screw dislocation 
is found on integration to be 


jt Gb 2 , R 

E s a — in — 

5 477 r 0 


(5) 


where Ft. and r 0 are appropriate upper and lower limits for the variable r. A rea- 
sonable value of r 0 is comparable to the magnitude b of the Burgers vector 
or to the lattice constant; the value of R cannot exceed the dimensions of the 
crystal. The value of the ratio R/r, ; is not very important because it enters in a 
logarithm term. 

We now show the form of the energy' of an edge dislocation. Let <T n . and 
a m denote the tensile stresses in the radial and circumferential directions, and 
let <r r0 denote the shear stress. In an isotropic elastic continuum, a n . and cr w 
are proportional to (sin 0)/r: wc need a function that falls off as 1/r and that 
changes sign when y is replaced by —j. The shear stress cr re is proportional to 
(cos 9)/r; considering the plane y = 0, we see from Fig. 4 that the shear stress 
is an odd function of x. The constants of proportionality in the stress are propor- 
tional to the shear modulus G and to the Burgers vector b of the displacement. 
The final result is 


<T, r — (Toa — " 


Gb sinfl 
2it(1 — v ) r 


a r9 : 


Gb cos 6 
2tt( 1 — v) r 


( 6 ) 


where the Poisson ratio v ~ 0.3 for most crystals. The strain energy' of a unit 
length of edge dislocation is 


Gb 2 

4it(1 —v) 



(7) 


We want an expression for the shear stress component <j xy on planes paral- 
lel to the slip plane in Fig. 4. From the stress components cr^. cr m , and <i rU 
evaluated on the plane a distance y above the slip plane, we find 


Gb ~ '/) 

277(1— v)‘ (x 2 + y 2 ) 2 ‘ 


( 8 ) 


It is shown in Problem 3 that the force caused by a resolved uniform shear 
stress rr is F = ba per unit length of dislocation. The force that an edge dislo- 
cation at the origin exerts upon a similar one at the location (y, 8) is 

„ i Gb 2 sin 46 

F 2*1 -v) 4 ,j 


per unit length. Here F is the component of force in the slip direction. 


( 9 ) 
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Low-angle Grain Boundaries 

Burgers suggested that low-angle boundaries between adjoining crystal- 
lites or crystal grains consist of arrays of dislocations. A simple example of the 
Burgers model of a grain boundary is shown in Fig. 11. The boundary occupies 
a (010) plane in a simple cubic lattice and divides two parts of the crystal that 
have a [001] axis in common. Such a boundary is called a pure tilt boundary: 
the misorientation can be described by a small rotation 6 about the common 
[001] axis of one part of the crystal relative to the other. The tilt boundary is 
represented as an array of edge dislocations of spacing D = b/6, where b is the 
Burgers vector of the dislocations. Experiments have substantiated this model. 
Figure 12 shows the distribution of dislocations along small-angle grain 
boundaries, as observed with an electron microscope. Further, Read and 
Shockley derived a theory of the interfacial energy as a function of the angle of 
tilt, with results in excellent agreement with measurements. 

Direct verification of the Burgers model is provided by the quantitative 
x-ray and optical studies of low-angle boundaries in germanium crystals by Vogel 



(b) 


Figure 11 (a) Low-angle grain boundary, after Burgers. ( b ) Electron micrograph of a low-angle 
grain boundary in molybdenum. The three dislocations in the image each have the same Burgers 
vector as in the drawing in Fig. 11a. The white circles mark the positions of atomic columns 
normal to the plane of the paper. Each array of circles defines the position of a dislocation, with 
four circles on the top of each array and three circles below'. Closure failure is indicated by the 
arrows which define the Burgers vectors. (Courtesy of R. Gronsky.) 



Figure 12 Electron micrograph of dislocation structures in low-angle grain boundaries in 
an Al-7 percent Mg solid solution. Notice the lines of small dots on the right. Mag. X 17,000. 
(R. Goodrich and G. Thomas.) 


Figure 13 Dislocation etch pits in low-angle 
boundary on (100) face of germanium; the angle of 
the boundary is 27.5”. The boundary lies in a (Oil) 
plane; the line of the dislocations is [100]. The 
Burgers vector is the shortest lattice translation vec- 
tor, or \b\ = a!\fi = 4.0 A. (F. L. Vogel, Jr.) 



and co-workers. By counting etch pits along the intersection of a low-angle 
grain boundary with an etched germanium surface (Fig. 13), they determined 
the dislocation spacing D. They assumed that each etch pit marked the end of 
a dislocation. The angle of tilt calculated from the relation 6 = b/D agrees well 
with the angle measured directly by means of x-rays. 

The interpretation of low-angle boundaries as arrays of dislocations is fur- 
ther supported by the fact that pure tilt boundaries move normal to them- 
selves on application of a suitable stress. The motion has been demonstrated in 
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Figure 14 Motion of a low-angle grain boundary under stress. The boundary is the straight vertical 
line, and it is photographed under vertical illumination, thereby making evident the 2° angular 
change in the cleavage surface of the zinc crystal at the boundary. The irregular horizontal line is a 
small step in the cleavage surface which serves as a reference mark. The crystal is clamped at the 
left; at the right it is subject to a force normal to the plane of the page. Top, original position of 
boundary; bottom, moved back 0.4 mm. (J. Washburn and E. R. Parker.) 


a beautiful experiment, Fig. 14. The specimen is a bicrystal of zinc containing 
a 2° tilt boundary with dislocations about 30 atomic planes apart. One side of 
the crystal was clamped, and a force was applied at a point on the opposite side 
of the boundary. Motion of the boundary took place by cooperative motion of 
the dislocations in the array, each dislocation moving an equal distance in its 
own slip plane. The motion was produced by stresses of the order of magni- 
tude of the yield stress for zinc crystals, strong evidence that ordinary defor- 
mation results from the motion of dislocations. 

Grain boundaries and dislocations offer relatively little resistance to diffu- 
sion of atoms in comparison with diffusion in perfect crystals. A dislocation is 
an open passage for diffusion. Diffusion is greater in plastically deformed ma- 
terial than in annealed crystals. Diffusion along grain boundaries controls the 
rates of some precipitation reactions in solids: the precipitation of tin from 
lead-tin solutions at room temperature proceeds about 10 8 times faster than 
expected from diffusion in an ideal lattice. 
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Dislocation Densities 

The density of dislocations is the number of dislocation lines that inter- 
sect a unit area in the crystal. The density ranges from well below 10 2 
dislocations/cm 2 in the best germanium and silicon crystals to 10 u or 10 12 dis- 
locations/cm 2 in heavily deformed metal crystals. The methods available for 
estimating dislocation densities are compared in Table 2. The actual dislocation 
configurations in cast or annealed (slowly cooled) crystals correspond 
either to a group of low-angle grain boundaries or to a three-dimensional 
network of dislocations arranged in cells, as shown in Fig. 15. 

Lattice vacancies that precipitate along an existing edge dislocation will eat 
away a portion of the extra half-plane of atoms and cause the dislocation to climb, 
which means to move at right angles to the slip direction. If no dislocations are 


Table 2 Methods for estimating dislocation densities 8 


Technique 

Specimen thickness 

Width of 
image 

Maximum practical 
density, per cm 2 

Electron microscopy 

>1000 A 

-100 A 

10 n -10 12 

X-ray transmission 

0. 1-1.0 mm 

5 /urn 

10 4 -10 o 

X-ray reflection 

<2 jjl ni (min.) — 50 /am (max.) 

2 /u m 

10 6 -10 7 

Decoration 

~10 p, m (depth of focus) 

0.5 /am 

2 X 10 7 

Etch pits 

no limit 

0.5 /X m b 

4 X 10 s 


“W. G. Johnston. 

b Limit of resolution of etch pits. 



Figure 15 Cell structure of three-dimensional tangles of dislocations in deformed aluminum. 
(P. R. Swann.) 
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Figure 16 Electron micrograph of dislocation loops formed by aggregation and collapse of 
vacancies in Al-5 percent Mg quenched from 550°C. The helical dislocations are formed by 
vacancy condensation on a screw dislocation. Mag. X 43, 000. (A. Eikum and G. Thomas.) 


present, the crystal will become supersaturated with lattice vacancies; their pre- 
cipitation in cylindrical vacancy plates may be followed by collapse of the plates 
and formation of dislocation rings that grow with further vacancy precipitation, as 
in Fig. 16. 

Dislocation Multiplication and Slip 

Plastic deformation causes a very great increase in dislocation density, 
typically from 10 8 to about 10 u dislocations/cm 2 during deformation. If a dis- 
location moves completely across its slip plane, an offset of one atom spacing is 
produced, but offsets up to 100 to 1000 atom spacings are observed. This 
means that dislocations multiply during deformation. 

Consider a closed circular dislocation loop of radius r surrounding a 
slipped area having the radius of the loop. Such a loop will be partly edge, 
partly screw, and mostly of intermediate character. The strain energy of the 
loop increases as its circumference, so that the loop will tend to shrink in size. 
However, the loop will tend to expand if a shear stress is acting that favors slip. 
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Figure 17 Frank-Read mechanism 
for multiplication of dislocations, 
showing successive stages in the gen- 
eration of a dislocation loop by the 
segment BC of a dislocation line. The 
process can be repeated indefinitely. 



Figure 18 A Frank-Read dislocation source in silicon, decorated with copper precipitates and 
viewed with infrared illumination. Two complete dislocation loops are visible, and the third, inner- 
most loop is near completion. (After W. C. Dash.) 


A common feature of all dislocation sources is the bowing of dislocations. 
A dislocation segment pinned at each end is called a Frank-Read source, and 
it can lead (Fig. 17) to the generation of a large number of concentric disloca- 
tions on a single slip plane (Fig. 18). Related types of dislocation multiplica- 
tion mechanisms account for slip and for the increased density of dislocations 
during plastic deformation. Double cross-slip is the most common source. 
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STRENGTH OF ALLOYS 

Pure crystals are very plastic and yield at veiy low stresses. There appear 
to be four important ways of increasing the yield strength of an alloy so that it 
will withstand shear stresses as high as 1 0 2 G. They are mechanical blocking 
of dislocation motion, pinning of dislocations by solute atoms, impeding dis- 
location motion by short-range order, and increasing the dislocation density so 
that tangling of dislocations results. All four strengthening mechanisms de- 
pend for their success upon impeding dislocation motion. A fifth mechanism, 
that of removing all dislocations from the crystal, may operate for certain fine 
hairlike crystals (whiskers) that are discussed in the section on crystal growth. 

Mechanical blocking of dislocation motion can be produced most directly 
by introducing tiny particles of a second phase into a crystal lattice. This 
process is followed in the hardening of steel, where particles of iron car- 
bide are precipitated into iron, and in hardening aluminum, where particles 
of Al 2 Cu are precipitated. The pinning of a dislocation by particles is shown 
in Fig. 19. 

In strengthening by the addition of small particles there are two cases to 
be considered: either the particle can be deformed with the matrix, which re- 
quires that the particle can be traversed by the dislocation, or the particle 
cannot be traversed by the dislocation. If the particle cannot be cut, the stress 



Figure 19 Dislocations pinned by particles in magnesium oxide. (Electron micrograph by 
G. Thomas and J. Washburn.) 
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necessary to force a dislocation between particles spaced L apart on a slip 
plane should be approximately 

a/G — b/L . (10) 

The smaller the spacing L, the higher is the yield stress u . Before particles pre- 
cipitate, L is large and the strength is low. Immediately after precipitation is 
complete and many small particles are present, L is a minimum and the strength 
is a maximum. If the alloy is then held at a high temperature, some particles 
grow at the expense of others, so that L increases and the strength drops. Hard 
intermetallic phases, such as refractory oxides, cannot be cut by dislocations. 

The strength of dilute solid solutions is believed to result from the pinning 
of dislocations by solute atoms. The solubility of a foreign atom will be greater 
in the neighborhood of a dislocation than elsewhere in a crystal. An atom that 
tends to expand the crystal will dissolve preferentially in the expanded region 
near an edge dislocation. A small atom wall tend to dissolve preferentially in 
the contracted region near the dislocation — a dislocation offers both expanded 
and contracted regions. 

As a result of the affinity of solute atoms for dislocations, each dislocation 
will collect a cloud of associated solute atoms during cooling, at a time when 
the mobility of solute atoms is high. At still lower temperatures, diffusion of 
solute atoms effectively ceases, and the solute atom cloud becomes fixed in the 
crystal. When a dislocation moves, leaving its solute cloud behind, the energy 
of the crystal must increase. The increase in energy can only be provided by an 
increased stress acting on the dislocation as it pulls away from the solute atom 
cloud, and so the presence of the cloud strengthens the crystal. 

The passage of a dislocation across a slip plane in pure crystals does not 
alter the binding energy across the plane after the dislocation is gone. The in- 
ternal energy of the crystal remains unaffected. The same is true for random 
solid solutions, because the solution is equally random across a slip plane after 
slip. Most solid solutions, however, have short-range order. Atoms of different 
species are not arranged at random on the lattice sites, but tend to have an ex- 
cess or a deficiency of pairs of unlike atoms. Thus in ordered alloys disloca- 
tions tend to move in pairs: the second dislocation reorders the local disorder 
left by the first dislocation. 

The strength of a crystalline material increases with plastic deformation. 
The phenomenon is called work-hardening or strain-hardening. The 
strength is believed to increase because of the increased density' of disloca- 
tions and the greater difficulty' of moving a given dislocation across a slip plane 
that is threaded by many dislocations. Strain-hardening frequently is em- 
ployed in the strengthening of materials, but its usefulness is limited to low' 
enough temperatures so that annealing does not occur. 

An important factor in strain-hardening is the total density of dislocations. 
In most metals dislocations tend to form cells (Fig. 15) of disloeation-free 
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areas of dimensions of the order of 1 /am. But unless we can get a uniform high 
density of dislocations we cannot strain-harden a metal to its theoretical 
strength, because of slip in the dislocation-free areas. A high total density is 
accomplished by explosive deformation or by special thermal-mechanical 
treatments, as of martensite in steel. 

Each of the mechanisms of strengthening crystals can raise the yield 
strength to the range of 10 -3 G to 10" 2 G. All mechanisms begin to break 
down at temperatures where diffusion can occur at an appreciable rate. When 
diffusion is rapid, precipitated particles dissolve; solute clouds drift along with 
dislocations as they glide; short-range order repairs itself behind slowly mov- 
ing dislocations; and dislocation climb and annealing tend to decrease the dis- 
location density. The resulting time-dependent deformation is called creep. 
This irreversible motion precedes the elastic limit. The search for alloys for 
use at very high temperatures is a search for reduced diffusion rates, so that 
the four strengthening mechanisms will survive to high temperatures. But the 
central problem of strong alloys is not strength, but ductility, for failure is 

often bv fracture. 
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DISLOCATIONS AND CRYSTAL GROWTH 

In some cases the presence of dislocations may he the controlling factor in 
crystal growth. When crystals are grown in conditions of low supersaturation, 
of the order of 1 percent, it has been observed that the growth rate is enor- 
mously faster than that calculated for an ideal crystal. The actual growth rate 
is explained in terms of the effect of dislocations on growth. 

The theory of growth of ideal crystals predicts that in crystal growth from 
vapor, a supersaturation (pressure/equilibrium vapor pressure) of the order of 
10 is required to nucleate new crystals, of the order of 5 to form liquid drops, 
and of 1.5 to form a two-dimensional monolayer of molecules on the face of a 
perfect crystal. Volmer and Schultze observed growth of iodine crystals at 
vapor supersaturations down to less than 1 percent, where the growth rate 
should have been down by the factor exp( — 3000) from the rate defined as the 
minimum observable growth. 

The large disagreement expresses the difficulty of nucleating a new mono- 
layer on a completed surface of an ideal crystal. But if a screw dislocation is 
present (Fig. 20), it is never necessary to nucleate a new layer: the crystal will 
grow in spiral fashion at the edge of the discontinuity shown. An atom can be 
bound to a step more strongly than to a plane. The calculated growth rates for 
this mechanism are in good agreement with observation. We expect that nearly 
all crystals in nature grown at low' supersaturation will contain dislocations, as 
otherwise they could not have grown. Spiral growth patterns have been ob- 
served on a large number of crystals. A beautiful example of the growth pat- 
tern from a single screw dislocation is given in Fig. 21. 
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Figure 20 Development of a spiral step produced by intersection of a screw dislocation with the 
surface of a crystal as in Fig. 8. (F. C. Frank.) 



Figure 21 Phase contrast micrograph of a hexagonal spiral growth pattern on a SiC crystal. The 
step height is 165 A. (A. R. Verma.) 


If the growth rate is independent of direction of the edge in the plane of 
the surface, the growth pattern is an Archimedes spiral, r = a6, where a is a 
constant. The limiting minimum radius of curvature near the dislocation is 
determined by the supersaturation. If the radius of curvature is too small, 
atoms on the curved edge evaporate until the equilibrium curvature is at- 
tained. Away from the origin each part of the step acquires new atoms at a con- 
stant rate, so that dr/dt = const. 

Whiskers 

Fine hairlike crystals, or whiskers, have been observed to grow under con- 
ditions of high supersaturation without the necessity for more than perhaps one 
dislocation. It may be that these crystals contain a single axial screw dislocation 
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Figure 22 A nickel whisker of diameter 1000 A bent in a loop. (R. W. De Blois.) 


that aids their essentially one-dimensional growth. From the absence of dis- 
locations we would expect these crystal whiskers to have high yield strengths, of 
the order of the calculated value G/30 discussed earlier in this chapter. A single 
axial screw dislocation, if present, could not cause yielding, because in bending 
the crystal the dislocation is not subjected to a shear stress parallel to its 
Burgers vector. That is, the stress is not in a direction that can cause slip. 
Herring and Galt observed whiskers of tin of radius ~1CT 4 cm with elastic 
properties near those expected from theoretically perfect crystals. They ob- 
served yield strains of the order of 10~ 2 , which correspond to shear stresses of 
order 10 -2 C, about 1000 times greater than in bulk tin, confirming the early 
estimates of the strength of perfect crystals. Theoretical or ideal elastic proper- 
ties have been observed for a number of materials as for carbon nanotubes. A 
single domain whisker of nickel is shown in Fig. 22. 


HARDNESS OF MATERIALS 


The hardness of materials is measured in several way's, the simplest test 
for nonmetals being the scratch test. Substance A is harder than substance B if 
A will scratch B but B will not scratch A. A standard scale is used for represen- 
tative minerals, with diamond, the hardest, assigned the value 10 and talc, the 
softest, assigned the value 1: 


10 diamond C 
9 corundum A1 2 0 3 
8 topaz Al 2 Si0 4 F 2 
7 quartz Si0 2 
6 orthoclase KAlSi 3 0 g 


5 apatite Cas^O^F 
4 fluorite CaF 2 
3 calcite CaC0 3 
2 gypsum CaS0 4 ■ 2H a O 
1 talc 3MgO • 4SiO a ■ H 2 0 


There is great current interest in the development of materials of great 
hardness, for example as films for use as scratch-resistant coatings on lenses. 
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It is widely felt that the scale between diamond and corundum is misleading, 
because diamond is much, much harder than corundum. It has been suggested 
that one might assign diamond the hardness 15, with the gap between 9 and 15 
to be filled in eventually by synthetic materials, such as compounds of C and B. 

Modem scales of hardness, such as the VHN scale, are based on indcntcr 
tests in which an indenter is pressed into the surface of the material and the 
size of the impression is measured. The Vickers Hardness Numbers of se- 
lected materials are tabulated below, after conversion by E. R. Weber to units 
of CPa [GN/m 2 ]: 


Diamond 

45.3 

BeO 

7.01 

SiC 

20.0 

Steel (quenched) 

4.59 

Si 3 N 4 

18.5 

Cu (annealed) 

0.25 

ai 2 o 3 

14.0 

Al (annealed) 

0.12 

B 

13.5 

Pb 

0.032 

WC 

11.3 




The data are from J. C. Anderson and others. 

Problems 

1. Lines of closest packing. Show that the lines of closest atomic packing are (110) in 
fee structures and (111) in bcc structures. 

2. Dislocation pairs, (a) Find a pair of dislocations equivalent to a row of lattice 
vacancies; (b) find a pair of dislocations equivalent to a row of interstitial atoms. 

3. Force on dislocation. Consider a crystal in the form of a cube of side L containing 
an edge dislocation of Burgers vector b. If the crystal is subjected to a shear stress <7 
on the upper and lower faces in the direction of slip, show, by considering energy 
balance, that the force acting on the dislocation is F = hn per unit length. 
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Figure 1 Ordered (a) and disordered (b) arrangements of AB ions in the alloy AB. 
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CHAPTER 22: ALLOYS 


GENERAL CONSIDERATIONS 

The theory of the band structure of solids assumes that the crystal has 
translational invariance. But suppose that the crystal is composed of two 
elements A and B that occupy at random the regular lattice sites of the struc- 
ture, in proportions x and 1 — x for the composition A X B L - X . The translational 
symmetry is no longer perfect. Will we then lose the consequences of band 
theory 7 , such as the existence of Fermi surfaces and of energy gaps? Will insula- 
tors become conductors because the energy gap is gone? We touched on these 
questions in the discussion of amorphous semiconductors in Chapter 19. 

Experiment and theory' agree that the consequences of the destruction of 
perfect translational symmetry are much less serious (nearly always) than we ex- 
pect at first sight. The viewpoint of the effective screened potential of Chapter 9 
is helpful in these matters, first because the effective potentials are relatively 
w'eak in comparison with a free ion potential and, second and most important, 
the differences between the effective potentials of the host and the additive 
atoms may be very weak in comparison with either alone. Alloys of Si and Ge or 
of Cu and Ag are classic examples of what we may call the relative ineffective- 
ness of alloying. 

In any event, a low concentration of impurity atoms cannot have much 
effect on the Fourier components U G of the effective potential I7(r) that is re- 
sponsible for the band gaps and for the form of the Fermi surface. (This state- 
ment implies that the Gs exist, which implies that a regular lattice exists. This is 
not an important assumption because we know that thermal phonons do not 
have drastic effects on the band structure, so that lattice distortions described 
as frozen-in pbonons should not have drastic effects. Tf the distortions are more 
serious, as with amorphous solids, the electronic changes can be significant.) 

It is true that an impurity atom will introduce Fourier components of C7(r) 
at wavevectors that are not reciprocal lattice vectors, but at low impurity con- 
centration such components are never large in comparison with the U c , arguing 
from the statistics of random potentials. The Fourier components at the recip- 
rocal lattice vectors G will still he large and will give the band gaps, Fermi sur- 
faces, and sharp x-ray diffraction lines characteristic of a regular lattice. 

The consequences of alloying will be particularly small when the impurity 
element belongs to the same column of the periodic table as the host element it 
replaces, because the atomic cores will make rather similar contributions to the 
effective potentials. 

One measure of the effect of alloying is the residual electrical resistivity, de- 
fined as the low temperature limit of the resistivity. Here we must distinguish 
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Figure 2 Resistivity of a disordered binary alloy of copper and gold. The variation of the residual 
resistivity depends on the composition Cu-Au,-. as x(l — x), which is known as Nordheim's Rule 
for a disordered alloy. Here x(l — x) is a measure of the degree of maximum disorder possible for 
a given value of x. (Johansson and Linde.) 


between disordered and ordered alloys. An alloy is disordered if the A and B 
atoms are randomly arranged, which occurs for a general value of x in the com- 
position A x Bi- x . For special values of x, such as 1/4, 1/2, and 3/4 for a cubic 
structure, it is possible for ordered phases to form, phases in which the A and B 
atoms form an ordered array. The distinction between order and disorder is 
shown in Fig. 1. The effect of order on the electrical resistivity is shown in 
Figs. 2 and 3. The residual resistivity increases with disorder, as discussed for 
amorphous materials in Chapter 19. The effect is shown in Fig. 2 for the Cu-Au 
alloy system. When the specimen is cooled slowly from a high temperature, or- 
dered structures are formed at Cu s Au and CuAu; these structures have a lower 
residual resistivity by virtue of their order, as in Fig. 3. 

Thus we can use the residual electrical resistivity to measure the effect 
of alloying in a disordered structure. One atomic percent of copper dissolved 
in silver (which lies in the same column of the periodic table) increases the 
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Cu Atomic percent An 


Figure 3 Effect of ordered phases on the resistivity of a binary alloy Cu.Aii, The alloys 
here have been annealed, whereas those in Fig. 2 have been quenched (cooled rapidly). The 
compositions of low residual resistivity correspond to the ordered compositions Cii 3 Au and CuAu, 
(Johansson and Linde.) 


residual resistivity by 0.077 /Ltohin-cm. This corresponds to a geometrical 
scattering cross section which is only 3 percent of the naive “projected area” of 
the impurity atom, so that the scattering effect is very' small. 

In insulators there is no experimental evidence for a significant reduction 
of band gap caused by the random potential components. For example, silicon 
and germanium form homogeneous solid solutions, known as substitutional 
alloys, over the entire composition range, but the band edge energies vary con- 
tinuously with composition from the pure Si gap to the pure Gc gap. 

It is widely' believed, however, that the density of states near the band 
edges in amorphous materials is smeared by the gross absence of translational 
symmetry. Some of the new states thus formed just inside the gap may not 
necessarily be current-carrying states because they may' not extend throughout 
the crystal. 
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SUBSTITUTIONAL SOLID SOLUTIONS— HUME-ROTHERY RULES 

We now discuss substitutional solid solutions of one metal A in another 
metal B of different valence, where A and B occupy, at random, equivalent 
sites in the structure. Huine-Rothery treated the empirical requirements for 
the stability of a solid solution of A and B as a single phase system. 

One requirement is that the atomic diameters be compatible, which 
means that they should not differ by more than IS percent. For example, the 
diameters are favorable in the Cu (2.55 A) — Zn (2.65 A) alloy system: zinc 
dissolves in copper as an fee solid solution up to 38 atomic percent zinc. The 
diameters are less favorable in the Cu (2.55 A) — Cd (2.97 A) system, where 
only 1.7 atomic percent cadmium is soluble in copper. The atomic diameters 
referred to copper are 1.04 for zinc and 1.165 for cadmium. 

Although the atomic diameters may be favorable, solid solutions will not 
form when there is a strong chemical tendency for A and B to form “intermetal- 
lic compounds,” which are compounds of definite chemical proportions. If A is 
strongly electronegative and B strongly electropositive, compounds such as AB 
and A 2 B may precipitate from the solid solution. (This is different from the for- 
mation of an ordered alloy phase only by the greater chemical bonding strength 
of the intermetallic compounds.) Although the atomic diameter ratio is favor- 
able for As in Cu (1.02), only 6 atomic percent As is soluble. The diameter ratio 
is also favorable for Sb in Mg (1.06), yet the solubility of Sb in Mg is very small. 

The electronic structure of alloys can often he described by the average 
number of conduction electrons (or valence electrons) per atom, denoted by n. 
In the alloy CuZn the value of n is 1.50; in CuAl, n = 2.00. Changes in electron 
concentration determine structural changes in many alloy systems. 

The phase diagram of the copper-zinc system 1 is shown in Fig. 4. The fee 
structure of pure copper (n = 1) persists on the addition of zinc (n = 2) until 
the electron concentration reaches 1.38. A bcc structure occurs at a minimum 
electron concentration of about 1.48. The y phase exists for the approximate 
range of n between 1.58 and 1.66, and the hep phase e occurs near 1.75. 

The term electron compound denotes an intermediate phase (such as 
the (3 phase of CuZn) whose crystal structure is determined by a fairly well de- 
fined electron to atom ratio. For many alloys the ratio is close to the Hume- 
Rothery rules: 1.50 for the j8 phase, 1.62 for the y phase, and 1.75 for the 
e phase. Representative experimental values are collected in Table 1, based on 
the usual chemical valence of 1 for Cu and Ag; 2 for Zn and Cd; 3 for AI and 
Ga; 4 for Si, Ge, and Sn. 

The Hume-Rothery rules find a simple expression in terms of the band 
theory of nearly free electrons. The observed limit of the fee phase occurs 

x The phases of interest are usually denoted by metallurgists by Creek characters: in the 
Cu-Zn system we have a (fee), /3 (bcc), y (complex cubic cell of 52 atoms), e (hep) and 17 (hep); f 
and t) differ considerably in c.!a ratio. The meaning of the characters depends on the alloy system. 
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Figure 4 Equilibrium diagram of phases in the copper-zinc alloy system. The a phase is fee; 
8 and 8 are bee: y is a complex structure; e and tj are both hep, but e has a c/a ratio near 1.56 
and r; (for pure Zn) has c/a = 1.86. The 0' phase is ordered bee, by which we mean that most of 
the Cu atoms occupy sites on one sc sublattice and most of the Zn atoms occupy sites on a second 
sc sublattice that interpenetrates the first sublattice. The )3 phase is disordered bcc; any site is 
equally likely to be occupied by a Cu or Zn atom, almost irrespective of what atoms arc in the 
neighboring sites. 



Table 1 Electron/atom ratios of electron compounds 


Alloy 

fee phase 
boundary 

Minimum 
bcc phase 
boundary 

■y-phase 

boundaries 

hep phase 
boundaries 

Cu-Zn 

1.38 

1.48 

1.58-1.66 

1.78-1.87 

Cu-Al 

1.41 

1.48 

1.63-1.77 


Cu-Ga 

1.41 




Cu-Si 

1.42 

1.49 



Cu-Ce 

1.36 




Cu-Sn 

1.27 

1.49 

1.60-1.63 

1.73-1.75 

Ag-Zn 

1.38 


1.58-1.63 

1.67-1.90 

Ag-Cd 

1.42 

1.50 

1.59-1.63 

1.65-1.82 

Ag-Al 

1.41 



1.55-1.80 
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close to the electron concentration of 1.36 at which an inscribed Fermi sphere 
makes contact with the Brillouin zone boundary for the fee lattice. The ob- 
served electron concentration of the bcc phase is close to the concentration 
1.48 at which an inscribed Fermi sphere makes contact with the zone bound- 
ary for the bcc lattice. Contact of the Fermi sphere with the zone boundary for 
the y phase is at the concentration 1.54. Contact for the hep phase is at the 
concentration 1.69 for the ideal da ratio. 

Why is there a connection between the electron concentrations at which a 
new phase appears and at which the Fermi surface makes contact with the 
boundary of the Brillouin zone? We recall that the energy bands split into two 
at the region of contact on the zone boundary (Chapter 9). If we add more 
electrons to the alloy at this stage, they will have to be accommodated in the 
upper band or in states of high energy near the zone comers of the lower 
band. Both options are possible, and both involve an increase of energy. It may 
also be energetically favorable for the crystal structure to change to one which 
can contain a Fermi surface of larger volume (more electrons) before contact 
is made with the zone boundary. In this way H. Jones made plausible the se- 
quence of structures fee, bcc, y, hep with increasing electron concentration. 

Measurements of the lattice parameter of Li-Mg alloys are shown in 
Fig. 5. In the range shown the structure is bcc. The lattice contracts during the 
initial stages of the addition of Mg to Li. When the lithium content drops 
below 50 atomic percent, corresponding to an average electron concentration 
increasing above 1.5 per atom, the lattice starts to expand. We have seen that 



Figure 5 Lattice parameter of body-centered cubic magnesium-lithium alloys. (After 
D. W. Levinson.) 
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Energy in eV 


Figure 6 Number of orbitals per unit energy range 
for the first Brillouin zone of the fee and bcc lat- 
tices, as a function of energy. 


for a spherical Fermi surface, contact with the zone boundary is established at 
n = 1.48 electrons per atom, in a bcc lattice. It appears that the expansion of 
the lattice arises from the onset of overlap across the zone boundary. 

The transformation from fee to bcc is illustrated by Fig. 6; this shows the 
number of orbitals per unit energy range as a function of energy, for the fee 
and bcc structures. As the number of electrons is increased, a point is reached 
where it is easier to accommodate additional electrons in the Brillouin zone of 
the bcc lattice rather than in the Brillouin zone of the fee lattice. The figure is 
drawn for copper. 


ORDER-DISORDER TRANSFORMATION 

The dashed horizontal line in the beta-phase (bee) region of the phase dia- 
gram (Fig. 4) of the Cu-Zn system represents the transition temperature be- 
tween the ordered (low temperature) and disordered (high temperature) 
states of the alloy. In the common ordered arrangement of an AB alloy with a 
bcc structure, all the nearest-neighbor atoms of a B atom are A atoms, and vice 
versa. This arrangement results when the dominant interaction among the 
atoms is an attraction between A and B atoms. (If the AB interaction is weakly 
attractive or repulsive, a two-phase system is formed in which some crystallites 
are largely A and other crystallites are largely B.) 

The alloy is completely ordered in equilibrium at absolute zero. It becomes 
less ordered as the temperature is increased, until a transition temperature is 
reached above which the structure is disordered. The transition temperature 
marks the disappearance of long-range order, which is order over many inter- 
atomic distances, but some short-range order or correlation among near 
neighbors may persist above the transition. The long-range order in an AB alloy 
is shown in Fig. 7a. Long- and short-range order for an alloy of composition 
AB :i is given in Fig. 7b. The degree of order is defined below. 
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Figure 7 (a) I .ong-rartge order versus temperature for an AB alloy. The transformation is second 
order, (b) Long-range and short-range order for an AB 3 alloy. The transformation for this composi- 
tion is first order. 


If an alloy is cooled rapidly from high temperatures to a temperature 
below the transition, a metastable condition may be produced in which a non- 
equilibrium disorder is frozen in the structure. The reverse effect occurs when 
an ordered specimen is disordered at constant temperature by heavy irradia- 
tion with nuclear particles. The degree of order may he investigated experi- 
mentally by x-ray diffraction. The disordered structure in Fig. 8 has diffraction 
lines at the same positions as if the lattice points were all occupied by only one 
type of atom, because the effective scattering power of each plane is equal to 
the average of the A and B scattering powers. The ordered structure has extra 
diffraction lines not possessed by the disordered structure. The extra lines are 
called superstructure lines. 

The use of the terms order and disorder in this chapter always refers to 
regular lattice sites; it is the occupancy that is randomly A or B. Do not con- 
fuse this usage with that of Chapter 19 on noncrystalline solids where there 
are no regular lattice sites and the structure itself is random. Both possibilities 
occur in nature. 
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(b) 


Figure 8 X-ray powder photographs in AuCu 3 alloy, (a) Disordered by quenching from T > T c ; 
(b) ordered by annealing at T < T c . (Courtesy of G. M. Gordon.) 


The structure of the ordered CuZn alloy is the cesium chloride structure 
of Chapter 1. The space lattice is simple cubic, and the basis has one Cu atom 
at 000 and one Zn atom at |§§. The diffraction structure factor 

S(hkl)=f Cu +fzne- M+,) . ( 1 ) 

This cannot vanish because / Cu / Zn ; therefore all reflections of the simple 
cubic space lattice will occur. In the disordered structure the situation is 
different: the basis is equally likely to have either Zn or Cu at 000 and either 
Zn or Cu at |||. Then the average structure factor is 

(S(hkl))=(f) + (f)e- Mh+k+,) , (2) 

where (/} = |(/ Cu +/zJ- Equation (2) is exactly the form of the result for the 
bcc lattice; the reflections vanish when h + k + l is odd. We see that the or- 
dered lattice has reflections (the superstructure lines) not present in the disor- 
dered lattice (Fig. 8). 

Elementary Theory of Order 

We give a simple statistical treatment of the dependence of order on tem- 
perature for an AB alloy with a bcc structure. The case A 3 B differs from AB, 
the former having a first-order transition marked by a latent heat and the latter 
having a second-order transition marked by a discontinuity in the heat capacity 
(Fig. 9). We introduce a measure of the long-range order. We call one simple 
cubic lattice a and the other b: the bcc structure is composed of the two inter- 
penetrating sc lattices, and the nearest neighbors of an atom on one lattice lie 
on the other lattice. If there are N atoms A and N atoms B in the alloy, the 
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Figure 9 Heat capacity versus tem- 
perature of CuZn alloy (/J-brass). 



long-range order parameter P is defined so that the number of As on the 
lattice a is equal to |(1 + P)N. The number of As on lattice b is equal to 
|(1 —P)N. When P = ±1, the order is perfect and each lattice contains only 
one type of atom. When P = 0, each lattice contains equal numbers of A and B 
atoms and there is no long-range order. 

We consider that part of the internal energy associated with the bond en- 
ergies of AA, AB, and BB nearest-neighbor pairs. The total bond energy is 

■E = N AA U AA "f N BB U BB +N ab U ab , (3) 

where is the number of nearest-neighbor ij bonds and U lf is the energy of 
an ij bond. 

The probability that an atom A on lattice a will have an AA bond is equal 
to the probability that an A occupies a particular nearest-neighbor site on b, 
times the number of nearest-neighbor sites, which is 8 for the bcc structure. 
We assume that the probabilities are independent. Thus, by the preceding ex- 
pressions for the number of As on a and b, 


K*a = 8[|(1 + P)N][ f<l - P)] = 2(1 - P 2 )N ; 
N bb = 8[|(1 + P)2V][|(1 - P)] = 2(1 - P 2 )N ; 
N ab = 8N[|(1 + P)] 2 + 8N[i(l -P)] 2 = 4(1 + P 2 )N . 


( 4 ) 
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The energy' (3) becomes 

E = E 0 + 2NP 2 U , 

(5) 

where 



F<> = 2i\'(t/ AA + 

+ 2U ab ) ; U = 2U ab -U aa ~U sb . 

(6) 


We now calculate the entropy of this distribution of atoms. There are 
|(1 + P)N atoms A and |(1 — P)N atoms B on lattice «; there are |(1 — P)N 
atoms A and §(1 + P)N atoms B on lattice b. The number of arrangements G of 
these atoms is 

G = 

From the definition of the entropy as S = k B In G, we have, using Stirling’s 
approximation, 

S = 2 Nk B In 2 - Nk B [(l + P) In(l + P) + (1 - P) In (1 - P) ] . (8) 

This defines the entropy of mixing. For P = ± 1, S = 0; for P = 0, S = 2Nk B In 2. 

The equilibrium order is determined by the requirement that the free 
energy’ F = E — TS be a minimum with respect to the order parameter P. On 
differentiating F with respect to P, we have as the condition for the minimum 

4NPU + Nk B T In = 0 . (9) 

The transcendental equation for P may be solved graphically; we find the 
smoothly decreasing curve shown in Fig. 7a. Near the transition we may 
expand (9) to find 4NPU + 2 Nk n TP = 0. At the transition temperature P = 0, 
so that 

T c — —2U/k B . (10) 

For a transition to occur, the effective interaction U must be negative. 

The short-range order parameter r is a measure of the fraction of the 
average number q of nearest-neighbor bonds that are AB bonds. When com- 
pletely disordered, an AB alloy has an average of four AB bonds about each 
atom A. The total possible is eight. We may define 

r=\(q~ 4), (11) 

so that r = 1 in complete order and r — 0 in complete disorder. Observe that r 
is a measure only of the local order about an atom, whereas the long-range 
order parameter P refers to the purity' of the entire population on a given sub- 
lattice. Above the transition temperature T c the long-range order is rigorously 
zero, but the short-range order is not. 
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PIIASE DIAGRAMS 

There is a large amount of information in a phase diagram even for a bi- 
nary system, as in Fig. 4. The areas enclosed by curves relate to the equilib- 
rium state in that region of composition and temperature. The curves mark the 
course of phase transitions as plotted in the T-x plane, where x is the composi- 
tion parameter. 

The equilibrium state is the state of minimum free energy of the binary 
system at given T, x. Thus the analysis of a phase diagram is the subject of 
thermodynamics. Several extraordinary results come out of this analysis, in 
particular the existence of low-melting-point eutectic compositions. Because 
the analysis has been treated in Chapter 11 of TP, we only outline the principal 
results here. 

Two substances will dissolve in each other and form a homogeneous mix- 
ture il that is the configuration of lowest free energy accessible to the compo- 
nents. The substances will form a heterogeneous mixture if the combined free 
energy of the two separate phases side by side is lower than the free energy of 
the homogeneous mixture. Now we say that the mixture exhibits a solubility 
gap. In Fig. 4 we see that compositions near Cu 0 S0 Zn 0 40 are in a solubility gap 
and are mixtures of fee and bcc phases of different structures and composi- 
tions. The phase diagram represents the temperature dependence of the solu- 
bility gaps. 

When a small fraction of a homogeneous liquid freezes, the composition of 
the solid that forms is almost always different from that of the liquid. Consider 
a horizontal section near the composition Cu 0 ^Ziiy 20 in Fig. 4. Let x denote the 
weight percent of zinc. At a given temperature, there are three regions: 

x > x L , the equilibrium system is a homogeneous liquid. 

x s < x < x L , there is a solid phase of composition x s and a liquid phase 
of composition x L . 

x < x s , equilibrium system is a homogeneous solid. 

The point x L traces a curve called the liquidus curve, and the point x s traces 
tl»e solidus curve. 

Eutectics. Mixtures with two liquidus branches in their phase diagram are 
called eutectics, as in Fig. 10 for the Au-Si system. The minimum solidification 
temperature is called the eutectic temperature; here the composition is the 
eutectic composition. The solid at this composition consists of two separate 
phases, as in the microphotograph of Fig. 11. 

There are many binary systems in which the liquid phase persists to tem- 
peratures below the lower melting temperature of the constituents. Thus 
Auo fi9 Si 0 31 solidifies at 370°C as a two-phase heterogeneous mixture, although 
Au and Si solidify at 1063°C and 1404°C, respectively. One phase of the eutec- 
tic is nearly pure gold; the other nearly pure silicon. 
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Pure Au Atomic percent silicon Pure Si 


Figure 10 Eutectic phase diagram of gold-silicon alloys. The eutectic consists of two 
branches that come together at T c = 370 °C and x„ = 0.31 atomic percent Si. (After Kittel and 
Kroemer, TP.) 



10 ,u.m 

Figure 1 1 Microphotograph of the Pb-Sn eutectic. (Courtesy of J. D. Hunt and K. A. Jackson.) 
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The Au-Si eutectic is important in semiconductor technology because the 
eutectic permits low temperature welding of gold contact wires to silicon de- 
vices. Lead-tin alloys have a similar eutectic of Pbo. 26 Sno .74 at 183°C. This or 
nearby compositions are used in solder: nearby if a range of melting tempera- 
tures is desired for ease in handling. 

TRANSITION METAL ALLOYS 

When we add copper to nickel, the effective magneton number per atom 
decreases linearly and goes through zero near Cu 0 60 Ni 0 40 , as shown in Fig. 12. 
At this composition the extra electron from the copper has filled the 3d band, 
or the spin-up and spin-down 3d sub-bands that were shown in Fig. 12.7b. The 
situation is shown schematically in Fig. 13. 


Figure 12 Bohr magneton numbers 
of nickel-copper alloys. 



Figure 13 Distribution of electrons in the alloy 
60Cu40Ni. The extra 0.6 electron provided by 
the copper has filled the d band entirely and in- 
creased slightly the number of electrons in the s 
band with respect to Fig. 12.7b. 
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Energy relative to Fermi energy' (eV) 


Figure 14 Density of states in nickel. (V. L. Moruzzi, J. F. Janak. and A. R. Williams.) 


For simplicity the block drawings represent the density of states as uni- 
form in energy. The actual density is known to be far from uniform; the result 
of a modern calculation is shown in Fig. 14 for nickel. The width of the 3 d 
band is about 5 eV. At the top, where the magnetic effects are determined, the 
density of states is particularly high. The average density of states is an order 
of magnitude higher in the 3d band than in the 4s band. This enhanced density' 
of states ratio gives a rough indication of the expected enhancement of the 
electronic heat capacity' and of the paramagnetic susceptibility in the nonfer- 
romagnetic transition metals as compared with the simple monovalent metals. 

Figure 15 shows the effect of the addition of small amounts of other ele- 
ments to nickel. On the hand model an alloying metal with z valence electrons 
outside a filled d shell is expected to decrease the magnetization of nickel by 
approximately z Bohr magnetons per solute atom. This simple relation holds 
well for Sn, Al, Zn, and Cu, with z = 4, 3, 2, and 1, respectively. For Co, Fe, 
and Mn the localized moment model of Friedel accounts for effective z values 
of —1, —2, and —3, respectively. 

The average atomic magnetic moments of binary alloys of the elements 
in the iron group are plotted in Fig. 16 as a function of the concentration of 
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Figure 15 Saturation magnetization of nickel alloys in Bohr magnetons per atom as a function of 
the atomic percent of solute element. 


electrons outside the 3 p shell. This is called a Slater- Pauling plot. The main se- 
quence of alloys on the right-hand branch follows the rules discussed in con- 
nection with Fig. 15. As the electron concentration is decreased, a point is 
reached at which neither of the 3 d sub-bands is entirely filled, and the mag- 
netic moment then decreases toward the left-hand side of the plot. 

Electrical Conductivity. It might be thought that in the transition metals the 
availability of the 3d band as a path for conduction in parallel with the 4s band 
would increase the conductivity, but this is not the way it works out. The resistiv- 
ity' of the s electron path is increased by collisions with the d electrons; this is a 
powerful extra scattering mechanism not available when the d hand is filled. 
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Figure 16 Average atomic moments of binary alloys of the elements in the iron group. (Bozorth.) 


We compare the values of the electrical resistivities of Ni, Pd, and Pt in 
microhm-cm at 18°C with that of the noble metals Cu, Ag, and Au immedi- 
ately following them in the periodic table: 



1.7 1.6 2.2 


The resistivities of the noble metals are lower than those of the transition 
metals by a factor of the order of 5. This shows the effectiveness of the s-d 
scattering mechanism. 


KONDO EFFECT 


In dilute solid solutions of a magnetic ion in a nonmagnetic metal crystal 
(such as Mn in Cu), the exchange coupling between the ion and the conduc- 
tion electrons has important consequences. The conduction electron gas is 
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Figure 17 Magnetization of a free electron Fermi gas at T = 0 in neighborhood of a point mag- 
netic moment at the origin r = 0, according to the RKKY theory. The horizontal axis is 2i t r, where 
k F is the waveveetor at the Fermi surface, (de Gennes.) 


magnetized in the vicinity of the magnetic ion, with the spatial dependence 
shown in Fig. 17. This magnetization causes an indirect exchange interaction 2 
between two magnetic ions, because a second ion perceives the magnetization 
induced bv the first ion. The interaction, known as the Friedel or RKKY inter- 
action, also plays a role in the magnetic spin order of the rare-earth metals, 
where the spins of the 4 f ion cores are coupled together by the magnetization in- 
duced in the conduction electron gas. 

A consequence of the magnetic ion-conduction electron interaction is the 
Kondo effect, discussed in a different context in Chapter 18. A minimum in 
the electrical resistivity-temperature curve of dilute magnetic alloys at low 
temperatures has been observed in alloys of Cu, Ag, Au, Mg, Zn with Cr, Mn, 
and Fe as impurities, among others. 

The occurrence of a resistance minimum is connected with the existence 
of localized magnetic moments on the impurity atoms. Where a resistance 
minimum is found, there is inevitably a local moment. Kondo showed that the 
anomalously high scattering probability of magnetic ions at low temperatures 


2 A review of indirect exchange interactions in metals is given by C. Kittel, Solid state physics 
22, 1 (1968); a review of the Kondo effect is given by J. Kondo, “Theory of dilute magnetic alloys,” 
Solid state physics 23, 184 (1969) and A. J. Heeger, “Localized moments and nonmoments in 
metals; the Kondo effect,” Solid state physics 23, 248 (1969). The notation RKKY stands for 
Ruderman, Kittel, Kasuya, and Yosida. 
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Figure 18 A comparison of experimental and theoretical results for the increase of electrical re- 
sistivity at low temperatures in dilute alloys of iron in gold. The resistance minimum lies to the 
right of the figure, for the resistivity increases at high temperatures because of scattering of elec- 
trons by thermal phonons. The experiments are due to D. K. C. MacDonald, W. B. Pearson, and 
I. M. Templeton; the theory is by J. Kondo. An exact solution was given by K. Wilson. 


is a consequence of the dynamic nature of the scattering by the exchange 
coupling and of the sharpness of the Fermi surface at low temperatures. The 
temperature region in which the Kondo effect is important is shown in Fig. 18. 
The central result is that the spin-dependent contribution to the resistivity is 


Pspin CPM 


3 z/ i 

1 + -~lnT 
€ f 




epo — cpj In T 


( 12 ) 


where J is the exchange energy; z the number of nearest neighbors; c the con- 
centration; and p M is a measure of the strength of the exchange scattering. 
We see that the spin resistivity increases toward low temperatures if ] is nega- 
tive. If the phonon contribution to the electrical resistivity goes as T 5 in the 
region of interest and if the resistivities are additive, then the total resistivity 
has the form 


p — aT 5 + cp 0 — cpi In T , 


(13) 




with a minimum at 




dp/dT = 5«r — cpi/T = 0 , 

(14) 

whence 




T m m = (cpi/5a) us . 

(15) 


The temperature at which the resistivity is a minimum varies as the one-fifth 
power of the concentration of the magnetic impurity, in agreement with experi- 
ment, at least for Fe in Cu. 


Problems 

1. Superlattice lines in Cu 3 Au. Cu 3 Au alloy (75% Cu, 25% Au) has an ordered state 
below 400°C, in which the gold atoms occupy the 000 positions and the copper 
atoms the ||0, |o|, and 0|^ positions in a face-centered cubic lattice. Give the in- 
dices of the new x-ray reflections that appear when the alloy goes from the disor- 
dered to the ordered state. List all new reflections with indices «2. 

2. Configurational heat capacity. Derive an expression in terms of P(T) for the heat 
capacity associated with order/disorder effects in an AB alloy. [The entropy (8) is 
called the configurational entropy or entropy of mixing ] 



APPENDIX A: TEMPERATURE DEPENDENCE 
OF THE REFLECTION LINES 


... I came to the conclusion that the sharpness 
of the interference lines mould not suffer but 
that their intensity should diminish with in- 
creasing angle of scattering, the more so the 
higher the temperature. 

P. Debye 

As the temperature of the crystal is increased, the intensity of the Bragg- 
reflected beams decreases, but the angular width of the reflected line does not 
change. Experimental intensities for aluminum are shown in Fig. 1. It is sur- 
prising that we can get a sharp x-ray reflection from atoms undergoing large 
amplitude random thermal motion, with instantaneous nearest-neighbor spac- 
ings differing by 10 percent at room temperature. Before the Laue experiment 



T, in K 


Figure 1 The dependence of intensity on temperature for the (hOO) x-ray reflections of 
aluminum. Reflections (7i00; with h odd are forbidden for an fee structure. (After R. M. Nicklow 
and R. A. Young.) 
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was done, but when the proposal was discussed 1 in a coffee house in Munich, 
the objection was made that the instantaneous positions of the atoms in a 
crystal at room temperature are far from a regular periodic array, because of 
the large thermal fluctuation. Therefore, the argument went, one should not 
expect a well-defined diffracted beam. 

But such a beam is found. The reason was given by Debye. Consider the 
radiation amplitude scattered by a crystal: let the position of the atom nomi- 
nally at Tj contain a term u(f) fluctuating in time: r(t) = r ; + u(t). We suppose 
each atom fluctuates independently about its own equilibrium position. 2 Then 
the thermal average of the structure factor (2.43) contains terms 

fj exp(— tG • ry){exp(— iG ■ u)) , (1) 

where { • • ■ ) denotes thermal average. The series expansion of the exponential is 

(exp { -iG • u)) = 1 — i{G • u) - ~((G • u) 2 ) H . (2) 

But <G • u) — 0, because u is a random thermal displacement uncorrelated 
with the direction of G. Further, 

{(G • u) 2 ) = G 2 (u 2 )( cos 2 e) ~ j(u 2 )G 2 . 

The factor f arises as the geometrical average of cos 2 0 over a sphere. 

The function 

exp(~l (u 2 ) G 2 ) = 1 - l(u 2 )G 2 + • • • (3) 

has the same series expansion as (2) for the first two terms shown here. For a 
harmonic oscillator all terms in the series (2) and (3) can be shown to be iden- 
tical. Then the scattered intensity, which is the square of the amplitude, is 

I = I 0 exp(~l(u 2 )G 2 ) , (4) 

where l n is the scattered intensity from the rigid lattice. The exponential factor 
is the Debye-Waller factor. 

Here (u 2 ) is the mean square displacement of an atom. The thermal aver- 
age potential energy (U) of a classical harmonic oscillator in three dimensions 
is lk B T, whence 


(u) = ±C<u 2 ) = i M(o 2 (u' 2 ) = \k B T , (5) 


1 P. P. Ewald, private communication. 

^his is the Einstein model of a solid; it is not a very good model at low temperatures, but it 
works well at high temperatures. 
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where C is the force constant, M is the mass of an atom, and w is the frequency of 
the oscillator. We have used the result u> 2 = C/M. Thus the scattered intensity is 

I(hkl) — I 0 exp( -kf/i'Cr/Mor) , (6) 

where hkl are the indices of the reciprocal lattice vector G. This classical result 
is a good approximation at high temperatures. 

For quantum oscillators { u 2 ) does not vanish even at T = 0; there is zero- 
point motion. On the independent harmonic oscillator model the zero-point 
energy is § ftw; this is the energy of a three-dimensional quantum harmonic 
oscillator in its ground state referred to the classical energy of the same oscilla- 
tor at rest. Half ol the oscillator energy is potential energy, so that in the 
ground state 


([/) = = jfico ; ( u 2 ) = 3fi/2Mo> , (7) 

whence, by (4), 

l(hkl) — I Q exp(-hG 2 /2Ma>) (8) 

at absolute zero. If G = 10 9 cm -1 , w — 10 14 s _1 , and M = 10 -2Z g, the argu- 
ment of the exponential is approximately 0.1, so that //Z u — 0.9. At absolute 
zero, 90 percent of the beam is elastically scattered and 10 percent is inelasti- 
cally scattered. 

We see from (6) and from Fig. 1 that the intensity of the diffracted line 
decreases, but not catastrophically, as the temperature is increased. Reflec- 
tions of low G are affected less than reflections of high G. The intensity we 
have calculated is that of the coherent diffraction (or the elastic scattering) in 
the well-defined Bragg directions. The intensity lost from these directions is 
the inelastic scattering and appears as a diffuse background. In inelastic scat- 
tering the x-ray photon causes the excitation or de-excitation of a lattice vibra- 
tion, and the photon changes direction and energy. 

At a given temperature the Debye-Waller factor of a diffraction line de- 
creases with an increase in the magnitude of the reciprocal lattice vector G as- 
sociated with the reflection. The larger IGi is, the weaker the reflection at high 
temperatures. The theory we have worked out here for x-ray reflection applies 
equally well to neutron diffraction and to the Mossbauer effect, the recoil- 
less emission of gamma rays by nuclei bound in crystals. 

X-rays can be absorbed in a crystal also by the inelastic processes of photo- 
ionization of electrons and Compton scattering. In the photoeffect the x-ray 
photon is absorbed and an electron is ejected from an atom. In the Compton 
effect the photon is scattered inelastically by an electron: the photon loses 
energy and the electron is ejected from an atom. The depth of penetration 
of the x-ray beam depends on the solid and on the photon energy, but 1 cm is 
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typical. A diffracted beam in Bragg reflection may remove the energy in a 
much shorter distance, perhaps 10 3 cm in an ideal crystal. 


APPENDIX B; EWALD CALCULATION OF LATTICE SUMS 


The problem is to calculate the electrostatic potential experienced by 
one ion in the presence of all the other ions in the crystal. We consider a lattice 
made up of ions with positive or negative charges and shall assume that the 
ions are spherical. 

We compute the total potential <p = <p 1 + <p 2 at an ion as the sum of two dis- 
tinct but related potentials. The potential <p y is that of a structure with a Gaussian 
distribution of charge situated at each ion site, with signs the same as those of 
the real ions. According to the definition of the Madelung constant, the charge 
distribution on the reference point is not considered to contribute to the poten- 
tial ipi or tp 2 (Fig. la). We therefore calculate the potential <p 1 as the difference 

<Pi = ‘Pa- <Ph 



<b) 

Figure I (a) Charge distribution used for computing potential tp t ; the potential <p a is computed (it 
includes the dashed curve at the reference point), while i p h is the potential of the dashed curve 
alone, (b) Charge distribution for potential <p,. The reference point is denoted by an X, 
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of two potentials, <p a being the potential of a continuous series of Gaussian dis- 
tributions and (p b being the potential of the single Gaussian distribution on the 
reference point. 

The potential <p 2 is that of a lattice of point charges with an additional 
Gaussian distribution of opposite sign superposed upon the point charges 
(Fig. lb). 

The point of splitting the problem into the two parts <pi and is that by a 
suitable choice of the parameter determining the width of each Gaussian peak 
we can get very good convergence of both parts at the same time. The Gaussian 
distributions drop out completely on taking the sum of the separate charge 
distributions giving rise to y>, and <p 2 > so that the value of the total potential <p is 
independent of the width parameter, but the rapidity of convergence depends 
on the value chosen for that parameter. 

We calculate first the potential < p a of a continuous Gaussian distribution. 
We expand <p a and the charge density p in Fourier series: 

<Pa = 2 c cexp(zG • r) ; (1) 

G 

p = ^p G ex p(iG-r), (2) 

G 

where G is 2tt times a vector in the reciprocal lattice. The Poisson equation is 

V 2 <P a = -4:77 p , 
or 

2 G 2 c c exp(iG - r) = 4tt 2 p c exp(£G * r) , 

so that 

c G =4irp G /G 2 . (3) 

We suppose in finding p c that there is associated with each lattice point of 
the Bravais lattice a basis containing ions of charge q t at positions r, relative to 
the lattice point. Each ion point is therefore the center of a Gaussian charge 
distribution of density 

p(r) = <7t( i?/rr )' V2 exp( —rjr 2 ) , 

where the factor in front of the exponential ensures that the total charge associ- 
ated with the ion is q t ; the range parameter 17 is to he chosen judiciously to ensure 
rapid convergence of the final result (6), which is in value independent of 77. 

We would normally evaluate p G by multiplying both sides of (2) by 
exp (— iG • r) and integrating over the volume A of one cell, so that the charge 
distribution to be considered is that originating on the ion points within the 
cell and also that of the tails of the distributions originating in all other cells. It 
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is easy to see, however, that the integral of the total charge density times 
exp[ — (iG • r)] over a single cell is equal to the integral of the charge density 
originating in a single cell times exp[— (iG ■ r)] over all space. 

We have therefore 


p G I exp(iG • r)exp(— iG • r ) dr — p G A 

one 
cell 

= J 2q ( (rj/irf /2 exp[— rj(r — r t f]exp(— iG • r) dr . 

all ' 
space 

This expression is readily evaluated: 

Pg A = 2 9 /exp(-iG • r t ) (r\/Tr) m f exp[-(i'G • £ + r)£ 2 )! d£ 

t £ 

-(?' 


■A\ 

space 


= I 2j </<exp(— iG • r f ) 1 exp(-G 2 /4i7) = S(G)exp(-G 2 /4r}) 


where S(G) = 2 q f exp(— iG • r ( ) is just the structure factor (Chapter 2) in 
appropriate units. Using (1) and (3), 

% = y S S(G)G“ 2 exp(iG • r - G 2 /4 tj) . (4) 

A G 

At the origin r = 0 we have 

% = cr E S(G)G- 2 exp(-G 2 /47j) . 

G 

The potential <Ph at tlie reference ion point i due to the central Gaussian 
distribution is 

<p b = [ (47 ti 2 dr)(p/r) = Zc/^/tt) 1 ^ , 

and so 

9i(i) = v2 S(G)G _2 exp(-G 2 /47j) - 2^(17 An -) 1 ' 2 . 

& G 

The potential <p 2 is to be evaluated at tbe reference point, and it differs 
from zero because other ions have the tails of their Gaussian distributions 
overlapping the reference point. The potential is due to three contributions 
from each ion point: 



hSo p(T)dr ~l r ^r dr 
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where the terms are from the point charge, from the part of the Gaussian dis- 
tribution lying inside a sphere of radius r f about the /tli ion point, and from 
that part lying outside the sphere, respectively. On substituting for p(r) and 
carrying out elementary' manipulations, we have 

<P2 = X % F ( r n 1 %) , (5) 

where 

F(x) = (2/tt 1/2 ) f exp( — s z ) ds . 

J X 

Finally, 

<p(i) = -yr £ S(G)G _2 exp(-G 2 /4Tj) - 2q l (y/rr) m + £ F(rj 1/2 r,) (6) 

A G 1 1 

is the desired total potential of the reference ion i in the field of all the other 
ions in the crystal. In the application of the Ewald method the trick is to 
choose 7} such that both sums in (6) converge rapidly. 

Ewald-Komfeld Method for Lattice Sums for Dipole Arrays 

Kornfeld extended the Ewald method to dipolar and quadrupolar array's. 
We discuss here the field of a dipole array at a point which is not a lattice 
point. According to (4) and (5) the potential at a point r in a lattice of positive 
unit point charges is 

<p = (4tt/A) £ S(G)G“ 2 exp[iG • r - G 2 /4r}] + £ F(V^ r,)/r, , (7) 

G l 

where r; is the distance from r to the lattice point l 

The first term on the right gives the potential of the charge distribution 
p = (r)/7r) 3/2 exp( — rjr 2 ) about each lattice point. By a well-known relation in 
electrostatics we obtain the potential of an array of unit dipoles pointing in the 
a direction by taking —d/dz of the above potential. The term under discussion 
contributes 

-(4iri/A) £ S(G)(G 3 /G 2 ) exp[iG • r - G 2 /4rj] , 

G 

and the z component of the electric field from this term is E, — d 2 (p/'dz 2 , or 

-(47r/A)2S(G)(G?/G 2 )exp[iG-r-G 2 /4r7] . (8) 

G 

The second term on the right of (7) after one differentiation gives 
- £ Zj[(F(VV,)/ff) + ( 2/rl)(r)/ir) m exp(-iq rf)] , 

i 



648 


and the z component of this part of the field is 

X (z?[(3 F(Vrjr,)/rf) + (6/rf)(rj/Tr) L ' 2 exp{ -17 rf) 
l 

+ (4/rf){i? J /ir) 1/2 exp(-rpf)] - [(F(V7jr;)/r?) (9) 

4- (2/rf)(7j/7r) V2 exp(-7jrf)]} . 

The total E. is given by the sum of (8) and (9). The effects of any number of 
lattices may be added. 


APPENDIX C: QUANTIZATION OF ELASTIC WAVES: PHONONS 


Phonons were introduced in Chapter 4 as quantized elastic waves. How do 
we quantize an elastic wave? As a simple model of phonons in a crystal, con- 
sider the vibrations of a linear lattice of particles connected by springs. We can 
quantize the particle motion exactly as for a harmonic oscillator or set of cou- 
pled harmonic oscillators. To do this we make a transformation from particle 
coordinates to phonon coordinates, also called wave coordinates because they 
represent a traveling wave. 

Let N particles ot mass M be connected by springs of force constant C and 
length a. To fix the boundary conditions, let the particles form a circular ring. 
We consider the transverse displacements of the particles out of the plane of 
the ring. The displacement of particle s is q s and its momentum is p s . The 
Hamiltonian of the system is 


Jl 

H = 1 


^p s 2 + |c(q s+1 - 



The Hamiltonian of a harmonic oscillator is 


(1) 


1 

2 M 


P 2 +icx 2 


and the energy eigenvalues are, where n = 0, 1, 2, 3, . . . , 






(2) 


(3) 


The eigenvalue problem is also exactly solvable for a chain with the different 
Hamiltonian (1). 

To solve (1) we make a Fourier transformation from the coordinates p s , q s 
to the coordinates P*, Q^, which are known as phonon coordinates. 
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Phonon Coordinates 

The transformation from the particle coordinates q s to the phonon coordi- 
nates Q k is used in all periodic lattice problems. We let 

q s = N~ V2 2 Q k exp(iksa) , (4) 

consistent with the inverse transformation 


Qk~N q s exp( - iksa ) . 

S 


(5) 


Here the N values of the wavevector k allowed by the periodic boundary con- 
dition q s = q s+N are given by: 


k — 2 Trn/Na ; n = 0, ± 1 , ±2, 




( 6 ) 


We need the transformation from the particle momentum p s to the momen- 
tum Pjt that is canonically conjugate to the coordinate Q k . The transformation is 


p, = N m 2, P k exp (—iksa); 

k 


P k ~N 1/2 exp(ifcsc) . 

X 


(7) 


This is not quite what one would obtain by the naive substitution of p for q and 
P for Q in (4) and (5), because k and —k have been interchanged between (4) 
and (7). 

We verify that our choice of P k and Q k satisfies the quantum commutation 
relation for canonical variables. We form the commutator 


[ft*] = X 


-1 


( 8 ) 


2 q r exp( ikra), p s exp (ik’sa) 

r s 

= X S k/Vp.J exp [~i{kr - k’s)a ] . 

r a 

Because the operators q, p are conjugate, they satisfy the commutation relation 

[q r,p s ] = ihS(r, s) , (9) 


where S(r,s) is the Kronecker delta symbol. 
Thus (8) becomes 


[Qk,Pk‘] = N~' ih 2 exp[— i(k — k')ra] = ih8(k, k') , (10) 

r 

so that Q k , P k also are conjugate variables. Here we have evaluated the summa- 
tion as 


2 exp [~i(k - k')ra ] = ^ exp[— t2rr(n - n')r/N] 

r r 

— N8(n, n') = N8(k, k') , 


(ID 


where we have used (6) and a standard result for the finite series in (11). 
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We carry out the transformations (7) and (4) on the hamiltonian (1), and 
make use of the summation (11): 

2 Pi = AT 1 2 2 2 p k p k' ex p[ ~i(k + k')sa ] 
s * k k ' ( 12 ) 

= '2'2, p M-W) = '2 p iP-k ; 
k r k 


2 (q s + 1 - q s ) 2 = W 1 2 2 2 exp(iksfl)[exp(iA'«) - 1] 

s s k k ' 

X exp(ik'sa)[exp(ik'a) - 1] = 2 2(?t(?-jfc(l ~ cos for) . (13) 

k 

Thus the hamiltonian (1) becomes, in phonon coordinates, 



jdtf PkP ~ k + _ cos • 


(14) 


If we introduce the symbol <x> k defined by 

o> k ^ (2 C/M ) m (1 - cos ka) m , (15) 

we have the phonon hamiltonian in the form 

H = ^{zM PkP ~ k + 2 M<4 QkQ ~ k ■ ( 16) 

The equation of motion of the phonon coordinate operator Q k is found by 
the standard prescription of quantum mechanics: 

it>Q k = [Q k ,H]=ihP_ k /M , (17) 

with H given by (14). Further, using the commutator (17), 

ihQ k = [ Q k , H] = M~'[P. k ,H] = ihiatQk , (18) 

so that 

Qk + <4 Qk = o • (19) 

This is the equation of motion of a harmonic oscillator with the frequency (o k . 
The energy eigenvalues of a quantum harmonic oscillator are 


e k - 




( 20 ) 


where the quantum number n k = 0, 1, 2, . . . . The energy of the entire system 
of all phonons is 



This result demonstrates the quantization of the energy of elastic waves on 
a line. 
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Creation and Annihilation Operators 

It is helpful in advanced work to transform the phonon hamiltonian (16) 
into the form of a set of harmonic oscillators: 

H = 2 ha> k (ata k + | j . (22) 

Here a k , a k are harmonic oscillator operators, also called creation and destruc- 
tion operators or boson operators. The transformation is derived below. 

The boson creation operator a + which “creates a phonon” is defined by 
the property 

a + |n) = (n + l) 1/2 |n + l) , (23) 

when acting on a harmonic oscillator state of quantum number n, and the boson 
annihilation operator a which “destroys a phonon” is defined by the property 

«]n) = n U2 \n — l) . (24) 

It follows that 

fl + fl|n) = a + n U2 \n — l) = n|n) , (25) 

so that | n) is an eigenstate of the operator a + a with the integral eigenvalue n, 
called the quantum number or occupancy of the oscillator. When the phonon 
mode k is in the eigenstate labeled by n k , we may say that there are n k phonons 
in the mode. The eigenvalues of (22) are U — 2 (n k + | )hu) i , in agreement 
with (21). 

Because 

aa^\n) — a{n +l) 1/2 |n 4- l) = (n + l)|n) , (26) 

the commutator of the boson wave operators at and a k satisfies the relation 

[a, a + ] = aa + — a + a — 1 . (27) 

We still have to prove that the hamiltonian (16) can be expressed as (19) in 
terms of the phonon operators at, a k . This can be done by the transformation 

at = (2 hY y2 [(Mw k ) m Q_ k - i(Maj k )~ m P k ] ; (28) 

a k = (2 h)- m [{M<o k ) m Q k + i(M(o k y m p_ k ] . (29) 

The inverse relations are 

Q k = {mMw k ) m {a k +at k ) ■, (30) 

P k =i(hMw k /2) V2 (at-a- k ) . (31) 

By (4), (5), and (29) the particle position operator becomes 

q s = 2 (h/2NMa> k ) h2 [a k exp (ika) + at exp(— iks)] . 


(32) 
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This equation relates the particle displacement operator to the phonon cre- 
ation and annihilation operators. 

To obtain (29) from (28), we use the properties 

QU = Q k ; n = P- k (33) 

which follow from (5) and (7) by use of the quantum mechanical requirement 
that q s and p s be hermitian operators: 

q, = <-U ; Ps = vt ■ (34) 

Then (28) follows from the transformations (4), (5), and (7). We verify that the 
commutation relation (33) is satisfied by the operators defined by (28) and (29): 

k. <** ] = mr\M<0 k [Q h <?_*.] - i[Q h F k ] + i[p- h Q- k ] 

+ [P_ k ,P k ]/M<o k ) . (35) 

By use of [Q k ,P k j = ih8(k,k') from (10) we have 

k,e£] = S(k,k') . (36) 

It remains to show that the versions of (16) and (22) of the phonon hamil- 
tonian are identical. We note that at k = w_ k from (15), and we form 


hoj k (a k a k + at k a_ k ) = (F k P k +P k P k ) + | Majl(Q k Q_ k + Q. k Q k ) . 


This exhibits the equivalence of the two expressions (14) and (22) for H. We 
identify o> k — (2C/M) 1/2 (1 - cos ka) m in (15) with the classical frequency of 
the oscillator mode of wavevector k. 


APPENDIX D: FERMI-DIRAC DISTRIBUTION FUNCTION 1 


The Fermi-Dirac distribution function 1 may be derived in several steps by 
use of a modern approach to statistical mechanics. We outline the argument 
here. The notation is such that conventional entropy S is related to the funda- 
mental entropy <x by S = k B a, and the Kelvin temperature T is related to the 
fundamental temperature r by r = k B T, where k B is the Boltzmann constant 
with the value 1.38066 X 10 23 J K. 

The leading quantities are the entropy, the temperature, the Boltzmann fac- 
tor, the chemical potential, the Cibbs factor, and the distribution functions. The 


‘This appendix follows closely the introduction to C. Kittel and H, Kroemer, Thermal 
Physics, 2nd ed.. Freeman, 1980 
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entropy measures the number of quantum states accessible to a system. A closed 
system might be in any of these quantum states and (we assume) with equal prob- 
ability. The fundamental assumption is that quantum states are either accessible 
or inaccessible to the system, and the system is equally likely to be in any one ac- 
cessible state as in any other accessible state. Given g accessible states, the en- 
tropy is defined as cr = log g. The entropy thus defined will be a function of the 
energy U, the number of particles N, and the volume V of the system. 

When two systems, each of specified energy, are brought into thermal 
contact, they may transfer energy; their total energy remains constant, but the 
constraints on their individual energies are lifted. A transfer of energy in one 
direction, or perhaps in the other, may increase the product g x g 2 that measures 
the number of accessible states of the combined systems. What wc call the 
fundamental assumption biases the outcome in favor of that allocation of the 
total energy that maximizes the number of accessible states: more is better, 
and more likely. This statement is the kernel of the law of increase of entropy, 
which is the general expression of the second law of thermodynamics. 

We have brought two systems into thermal contact so that they may trans- 
fer energy. What is the most probable outcome of the encounter? One system 
will gain energy at the expense of the other, and meanwhile the total entropy 
of the two systems will increase. Eventually the entropy will reach a maximum 
for the given total energy. It is not difficult to show that the maximum is at- 
tained when the value of (da/dU) yv for one system is equal to the value of the 
same quantity for the second system. This equality property for two systems in 
thermal contact is the property' we expect of the temperature. Accordingly, the 
fundamental temperature r is defined by the relation 


1 

T 



( 1 ) 


The use of I/r assures that energy will flow from high r to low r; no more com- 
plicated relation is needed. 

Now consider a very simple example of the Boltzmann factor. Let a small 
system with only two states, one at energy 0 and one at energy e, be placed in 
thermal contact with a large system that we call the reservoir. The total energy 
of the combined systems is U 0 ; when the small system is in the state of energy 
0, the reservoir has energy [7 0 and will have g (U Q ) states accessible to it. When 
the small system is in the state of energy e, the reservoir will have energy 
Ur — € and will have g (U a — e) states accessible to it. By the fundamental as- 
sumption, the ratio of the probability of finding the small system with energy e 
to the probability of finding it with energy 0 is 


P(e) _ g(U n ~ g) _ exp[o-(U 0 — e)] 
P(0) g(U 0 ) exp [<r( [/<,)] 


( 2 ) 
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The reservoir entropy a may be expanded in a Taylor series: 

(t(U 0 -e) — a(U 0 ) - e(dcr/dU a ) = rr(L/ 0 ) - ©V , (3) 

by the definition (1) of the temperature. Higher order terms in the expansion 
may be dropped. Cancellation of the term exp[<r([; T 0 )], which occurs in the nu- 
merator and denominator of (2) after the substitution of (3), leaves us with 

P(e)/P(0) = exp(— e/r) . (4) 

This is Boltzmann’s result. To show its use, we calculate the thermal aver- 
age energy (e) of the two-state system in thermal contact with a reservoir at 
temperature t: 

, , „ eexp(— e/r) 

(*) - 2 M - 0 • P(0) + eP(e) - 1 + ^p ( _ 6/T) ■ (5) 

where we have imposed the normalization condition on the sum of the 
probabilities: 

P( 0) + P(e) = 1 . (6) 


The argument can be generalized immediately to find the average energy' of a 
harmonic oscillator at temperature t, as in the Planck law. 

The most important extension of the theory is to systems that can transfer 
particles as well as energy with the reservoir. For two systems in diffusive and 
thermal contact, the entropy will be a maximum with respect to the transfer of 
particles as well as to the transfer of energy'. Not only must (dcr/dU) jV V be equal 
for the two systems, but (d<r/dN) vv must also be equal, where A' refers to the 
number of particles of a given species. The new equality' condition is the occa- 
sion for the introduction 2 of the chemical potential p: 


F 

r 



(7) 


For two systems in thermal and diffusive contact, t, = r 2 and p, — p 2 . The 
sign in (7) is chosen to ensure that the direction of particle flow is from high 
chemical potential to low chemical potential as equilibrium is approached. 

The Gibbs factor is an extension of the Boltzmann factor (4) and allows us 
to treat systems that can transfer particles. The simplest example is a system 
with two states, one with 0 particles and 0 energy, and one with 1 particle and 
energy e. The system is in contact with a reservoir at temperature r and chem- 
ical potential p. We extend (3) for the reservoir entropy: 


<r(U 0 - e ; N 0 - 1) = (t(U o; N 0 ) - e(da /dU 0 ) - 1 • (da/dN n ) 
— o-(l/ 0 ;N 0 ) - e/r + p/r . 


( 8 ) 


2 TP Chapter 5 has a careful treatment of the chemical potential 
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By analogy with (4), we have the Gibbs factor 

P(l,e)/P(0,0) = exp[(ju — e)/r] , (9) 

for the ratio of the probability that the system is occupied by 1 particle at 
energy e to the probability that the system is unoccupied, with energy 0. The 
result (9) after normalization is readily expressed as 


P(U) 


1 

exp[(e -,u)/t] + 1 


This is the Fermi- Dirac distribution function. 


( 10 ) 


APPENDIX E: DERIVATION OF THE dk/dt EQUATION 


The simple and rigorous derivation that follows is due to Kroemer. In 
quantum mechanics, for any operator A we have 

d(A)/dt = (i/h)([H,A]) , (1) 

where H is the hamiltonian. See also C. L. Cook, American J. Phys. 55, 953 
(1987). 

We let A be the lattice translation operator T defined by 

Tf(x ) =/(x + a) , (2) 

where a is a basis vector, here in one dimension. For a Bloch function 

Ti/j k (x) = evp(ika){j/ k {x) . (3) 

This result is usually written for one band, hut it holds even if 4>k is a linear 
combination of Bloch states from any number of bands, but having the identi- 
cal wavevector k in the reduced zone scheme. 

The crystal hamiltonian H 0 commutes with the lattice translation operator 


T, so that [Ho.T] = 0. 

If we add a uniform external force F, then 



H = H 0 - Fx , 

(4) 

and 

[H, T] - FaT . 

(5) 

From (1) and (5), 

d(T)/dt - ( i/h)(Fa)(T > . 

(6) 
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From (6) we form 


(T)*d{T)/dt = (iFa/h)\(T)\ z • 

(r)d(T*)/dt = - (iFa!h)\(T )\ 2 . 

On addition, 

d\(T)\ 2 /dt = 0 . (7) 

This is the equation of a circle in the complex plane. The coordinate axes 
in the plane are the real and imaginary parts of the eigenvalue exp (ika). If (T) 
is initially on the unit circle, it will remain on the unit circle. 

For fas that satisfy periodic boundary conditions, { T ) can lie on the unit 
circle only if ip k is a single Bloch function or a superposition of Bloch functions 
from different bands, but with the same reduced k. 

As (T) moves around the unit circle, the wavevector k changes exactly at 
the same rate for the components of fa in all bands. With ( T ) = exp (ika), w r e 
have from (6) that 


ia dk/dt — iFa/h , 

(8) 

dk/dt = F/h , 

(9) 


an exact result. 

This does not mean that interhand mixing (such as Zener tunneling) does 
not occur under the influence of applied electric fields. It just means that k 
evolves at a constant rate for every component of a wave packet. The result is 
easily extended to three dimensions. 


APPENDIX F: BOLTZMANN TRANSPORT EQUATION 


The classical theory of transport processes is based on the Boltzmann trans- 
port equation. We work in the six-dimensional space of Cartesian coordinates r 
and velocity v. The classical distribution function/(r,v) is defined by the relation 

f(ry)drd\ = number of particles in drdv . (1) 

The Boltzmann equation is derived by the following argument. We con- 
sider the effect of a time displacement dt on the distribution function. The 
Liouville theorem of classical mechanics tells us that if we follow a volume 
element along a flowline the distribution is conserved: 

fit + dt , r + dry + d\) — fitjy) , 


( 2 ) 
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in the absence of collisions. With collisions 

f(t + dt , r + dr,\ + d\) —fit, r,v) — dt(3/73t) coU . (3) 

Thus 

dt(dffdt) + dr • grad,, f + dv • grad, / = dt(df/dt ) co D . (4) 

Let a denote the acceleration d\/dt; then 

(5) 

This is the Boltzmann transport equation. 

In many problems the collision term (d/7dt) cal] may be treated by the intro- 
duction of a relaxation time T e (r,v), defined by the equation 

(dfidt) ccii= ~{f~fo)K ■ ( fi ) 

Here/o is the distribution function in thermal equilibrium. Do not confuse t c 
for relaxation time with r for temperature. Suppose that a non equilibrium dis- 
tribution of velocities is set up by external forces which are suddenly removed. 
The decay of the distribution towards equilibrium is then obtained from (6) as 

d(f~/o) _ f~fo m 

dt T c • [ ' 

if we note that df^/dt — 0 by definition of the equilibrium distribution. This 
equation has the solution 

(/ - fo) t = (/ - fo)t=o exp( ~t /T c) • (8) 

It is not excluded that t c may be a function of r and v. 

We combine (1), (5), and (6) to obtain the Boltzmann transport equation 
in the relaxation time approximation: 

(9) 

In the steady state df/dt = 0 by definition. 

Particle Diffusion 

Consider an isothermal system with a gradient of the particle concentra- 
tion. The steady-state Boltzmann transport equation in the relaxation time ap- 
proximation becomes 

v x df/dx = - (f-f 0 )/r c , 




( 10 ) 
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where the nonequilibrium distribution function /varies along the x direction. 
We may write (10) to first order as 

fi=fo~ v x T c df 0 /dx , (11) 

where we have replaced df/dx by df ( /dx. We can iterate to obtain higher order 
solutions when desired. Thus the second order solution is 

U = /o - v^jfjdx =/„ - iKT/lfc'/dx + v y c d%/dx 2 . (12) 

The iteration may be used in the treatment of nonlinear effects. 


Classical Distribution 

Let/, be the distribution function in the classical limit: 

fo = ex p[(f t ~ e)/r] . (13) 

We are at liberty to take whatever normalization for the distribution function 
is most convenient because the transport equation is linear in /and/,. We can 
take the normalization as in (13) rather than as in (1). Then 


d,f Q /dx — idf () /djj,)(d pJdx ) = (f 0 /r)(dfi/dx) , (14) 

and the first order solution (11) for the nonequilibrium distribution becomes 

f~fo~ (v x r c f 0 /T)(d/jL/dx) . (15) 

The particle flux density in the x direction is 

r n =jvJD(e)de , (16) 

where D(e) is the density of electron states per unit volume per unit energy 
range: 

1 / 2M\ 3/2 


D(e) 


,1/2 


27T 2 \h 2 / 


Thus 


Jl = J v I f 0 D(e)de - (dfi/dx) j ( v 2 T c f 0 /T)D(e)de . 


(17) 


(18) 


The first integral vanishes because v T is an odd function and f 0 is an even func- 
tion of v x . This confirms that the net particle flux vanishes for the equilibrium 
distribution/,. The second integral will not vanish. 

Before evaluating the second integral, we have an opportunity to make use 
of what we may know about the velocity dependence of the relaxation time t c . 
Only for the sake of example we assume that t c is constant, independent of ve- 
locity; t c may then be taken out of the integral: 

Jl= -idp./dx)iT,JT} J v 2 r fP(e)de . 


09) 
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The integral may be written as 

3 J v 2 f 0 D(€)de = (iMv 2 )f 0 D(e)de = nr/M , (20) 

because the integral is just the kinetic energy density |nr of the particles. Here 
ff 0 D(e)de = n is the concentration. The particle flux density is 

J„— —(nT c /M)(dfi/dx) = —(T c T/M)(dn/dx) , (21) 


because 


ft — r log n + constant . (22) 

The result (21) is of the form of the diffusion equation with the diffusivity 

D n = t c t/M = i(u 2 )r c . (23) 

Another possible assumption about the relaxation time is that it is in- 
versely proportional to the velocity, as in r c = l/v, where the mean free path l is 
constant. Instead of (19) we have 

= ~(dfi/dx)(l/T) j (v*/v)foD(e)de , (24) 


and now the integral may be written as 

| J vf 0 D(e)de = | nc , 

where c is the average speed. Thus 

Jl = ~k(lcn/T)(dfL/dx) — — \lc(dn/dx ) , 
and the diffusivity is 

— xlc . 


(25) 


(26) 


(27) 


Fermi-Dirac Distribution 

The distribution function is 


^ exp[(e - + 1 ’ (28) 

To form df 0 /dx as in (14) we need the derivative df 0 /dfi. We argue below 
that 


df 0 /dn — 8(e — fx) , 


(29) 
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at low temperatures r /a. Here 5 is the Dirac delta function, which has the 
property for a general function F(e) that 


F(e)S(e - ti)de = F(/x) 


(30) 


Now consider the integral F(e)(df 0 /dfJi)de. At low temperatures df 0 /d fj. is very 
large for e = /x and is small elsewhere. Unless the function F(e) is very rapidly 
varying near /i. we may take F(e) outside the integral, with the value F(ju,): 


[ F(e)(df 0 /d[i)de — F(fi) [ (V//,/d/x)de = -F(/tt) f (df 0 /de)de 
■>0 ■>(! J o 


= -F( M )[/ 0 (e)]„” = F( M )/o(0) , 


(31) 


where we have used df 0 /dn = —df(/de. We have also used/ 0 = 0 for e = oo. At 
low temperatures/(0) — 1; thus the right-hand side of (31) is just F(fi), consis- 
tent with the delta function approximation. Thus 

df 0 /dx = 8(e — /JL)d/Ji/dx . (32) 

The particle flux density is, from (16), 

Jl = -(d/jJdx)T c jv*8(e - fi)D[e)de , (33) 

where r c is the relaxation time at the surface e = ju. of the Fermi sphere. The 
integral has the value 

kvf(3n/2e t ) - n/m , (34) 

by use of D(fi) — 3n/2e F at absolute zero, where e F = rpnv 2 defines the velocity 
v F on the Fermi surface. Thus 

J x „ = —{nT c /m)dfju/dx . 

At absolute zero /r(0) = (ft 2 /2m)(3ir 2 n) 2/3 , whence 

d^i/dx = [|( h 2 /2m)(3T7^) M /n l/3 ]dn/dx 
— %{e F /n)dn/dx , 

so that (33) becomes 

Jl = — (2r„/3m)e F dn/dx = —\v f t c dn/dx . (37) 

The diffusivity is the coefficient of dn/dx: 

D n = Iv 2 f t c , (38) 

closely similar in form to the result (23) for the classical distribution of veloci- 
ties. In (38) the relaxation time is to be taken at the Fermi energy. 


(35) 

(36) 
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We see we can solve transport problems where the Fermi-Dirac distribu- 
tion applies, as in metals, as easily as where the classical approximation 
applies. 

Electrical Conductivity 

The isothermal electrical conductivity <x follows from the result for the 
particle diffusivity when we multiply the particle flux density' by the particle 
charge q and replace the gradient dfi/dx of the chemical potential by the gra- 
dient qdtp/dx = ~qE x of the external potential, where E, is the x component of 
the electric field intensity. The electric current density follows from (21): 

J, = (nq 2 rJm)E ; a = nq\/m , (39) 

for a classical gas with relaxation time t c . For the Fermi-Dirac distribution, 
from (35), 

J q = (nq 2 T c /m)E ; a = nq 2 T r Jm . (40) 


APPENDIX G: VECTOR POTENTIAL, FIELD MOMENTUM, 
AND GAUGE TRANSFORMATIONS 


This section is included because it is hard to find the magnetic vector po- 
tential A discussed thoroughly in one place, and we need the vector potential 
in superconductivity. It may seem mysterious that the hainiltonian of a particle 
in a magnetic field has the form derived in (18) below: 

« = 157 (p- ? A ) = + ^ • (1 > 

where Q is the charge; M is the mass; A is the vector potential; and <p is 
the electrostatic potential. This expression is valid in classical mechanics and 
in quantum mechanics. Because the kinetic energy of a particle is not changed 
by a static magnetic field, it is perhaps unexpected that the vector potential 
of the magnetic field enters the hamiltonian. As we shall see, the key is the 
observation that the momentum p is the sum of two parts, the kinetic 
momentum 

Pkin = Afv 

which is familiar to us, and the potential momentum or field momentum 


( 2 ) 
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The total momentum is 


p = Pki„ + Pficid = Mv + Y A , 


and the kinetic energy is 

The vector potential 1 is related to the magnetic field by 

B = curl A . 


(4) 

(5) 

(6) 


We assume that we work in nonmagnetic material so that H and B are treated 
as identical. 


Lagrangian Equations of Motion 

To find the Hamiltonian, the prescription of classical mechanics is clear: we 
must first find the Lagrangian. The Lagrangian in generalized coordinates is 

L=\ Mq 2 - Q<p( q) + ^ q • A(q) . (7) 


This is correct because it leads to the correct equation of motion of a charge in 
combined electric and magnetic fields, as we now show. 

In Cartesian coordinates the Lagrange equation of motion is 


d dL _ dL _ q 
dt dx dx 

and similarly for y and z. From (7) we form 


d dL 


dL 

dx 

dL 

dx 

QdA x 


_ ~ d <p , Q(- dA * . .- aA y , -- 

Q dx ■" ' X Z* y *»■ 


. Q 

— Mx H A. 


dx 


dx 


£4 

dx 


= ~ = Mx + — 

dt dx c dt c 


Q ( d A. 


dA r 


dA r 


dA r 


+ x — i + ii — 1 + z- 
dt dx dlf dz 


Thus (8) becomes 


d<z> Q 
Mx + Q^- + — 
x dx c 


dA r 


dt + y\dy 


dA. dA, 


dx 


+ i| 


£A*_ SA_\ 
dz dx ) 


— 0 


( 8 ) 

(9) 

( 10 ) 

(ID 

( 12 ) 


'For an elementary treatment of the vector potential see E. M. Purcell, Electricity and 
magnetism, 2nd ed., McGraw-Hill, 1984. 
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or 

A\ Q 

M^f = <?£, + MvXBL , 

with 

F = _ 1 ;>A x 

dx c dt ; 


(13) 

(14) 


B = curl A . (15) 

Equation (13) is the Lorentz force equation. This confirms that (7) is correct. 
We note in { 14) that E has one contribution from the electrostatic potential <p 
and another from the time derivative of the magnetic vector potential A. 

Derivation of the Hamiltonian 

The momentum p is defined in terms of the Lagrangian as 

p=i4 = Mq+^A, (16) 

r aq c 

in agreement with (4). The hamiltonian if(p,q) is defined by 

H (pq) -p q - L , (17) 

or 

H = Mr/ 2 + fq-A- |Mq 2 + Q<P ~ ^q * A = ~ (p - + Q<p , (18) 

as in (1). 

Field Momentum 

The momentum in the electromagnetic field that accompanies a particle 
moving in a magnetic field is given by the volume integral of the Poynting vec- 
tor, so that 

Pfield = JdVEXB. (19) 


We work in the nonrelativistic approximation with v < c, where v is the veloc- 
ity of the particle. At low values of vie we consider B to arise from an external 
source alone, but E arises from the charge on the particle. For a charge 
at r'. 


Thus 


E = — V(p ; V z <p = — 4-uQ<5(r — r') . 


P/ = 


4m- 1 ^ X curl A . 


( 20 ) 


( 21 ) 
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By a vector relation we have 

/ dVV<p X curl A = — / dV[ A X curl (V<p) — Adiv V<p — (V<p) divA] . (22) 

But curl (A<p) = 0, and we can always choose the gauge such that div A = 0. 
This is the transverse gauge. 

Thus, we have 

P/ = J JVAV 2 <p = “JdVApS(r-r') =^A . (23) 

This is the interpretation of the field contribution to the total momentum 
p — Mv + QAJc. 


GAUGE TRAN S FO RMATIO N 

Suppose Hip = eip, where 

1 


Let us make a gauge transformation to A' , where 

A' = A + Vy , 


(24) 


(25) 


where y is a scalar. Now B = curl A = curl A', because curl (Vy! = 0. The 
Schrodinger equation becomes 

\2 


What di' satisfies 


i O Q 




with the same e as for ip? Equation (27) is equivalent to 


i (_ Q. Q 


2 M 


We try 


Now 


so that 


P~- A - — Vy) p' — eip' 


ip'= exp(i<^y/ftc)i/» . 


pip' = expu<4y/7m) p i/i 4- c (Vy)exp(tQy/ftc)i/' 


O 

p- T Vy } *P' = exp(i£y/fic)pi/< 


(26) 


(27) 


(28) 


( 29 ) 
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and 


if (f ■ C A - C v *) f * “p(WW aif (p “ 7 A ) * ,, m 

(oUj 

— exp(i^/^c)ei^ . 

Thus d>' = exp(i£fy/ftc)!/f satisfies the Schrodinger equation after the 
gauge transformation (25). The energy e is invariant under the transformation. 

The gauge transformation on A merely changes the local phase of the 
wave function. We see that 


= (/,*</» , (31) 

so that the charge density is invariant under a gauge transformation. 

Gauge in the London Equation 

Because of the equation of continuity in the flow of electric charge we 
require that in a superconductor 

divj = 0 , 

so that the vector potential in the London equation j = must satisfy 

div A = 0 . (32) 

Further, there is no current flow through a vacuum/superconductor interface. 
The normal component of the current across the interface must vanish-. j a = 0, 
so that the vector potential in the London equation must satisfy 

A u = 0 . (33) 

The gauge of the vector potential in the London equation of superconductivity 
is to be chosen so that (32) and (33) are satisfied. 


APPENDIX H: COOPER PAIRS 


For a complete set of states of a two-electron system that satisfy periodic 
boundary conditions in a cube of unit volume, we take plane wave product 
functions 

‘P(k l 5 k 2 ;r 1 ,r 2 ) = expMIq • ly + k 2 • r 2 )] . 

We assume that the electrons are of opposite spin. 


( 1 ) 
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We introduce center-of-mass and relative coordinates: 

R = |(r 1 + r 2 ) ; r = rj - r 2 ; (2) 

K = kj + k 2 ; k = |(kj - k 2 ) , (3) 

so that 

kj • 1 -j + k 2 • r 2 = K • R + k • r (4) 

Thus (1) becomes 

<p(K,k;R,r) = exp(iK • R) exp(ik • r) , (5) 

and the kinetic energy of the two-electron system is 

e K + Ek = + k 2 ) . (6) 

We give special attention to the product functions for which the center-of- 
mass wavevector K = 0, so that k, = -k 2 . With an interaction H l between the 
two electrons, we set up the eigenvalue problem in terms of the expansion 

A'(r) = 2g k exph'k • r) . (7) 

The Schrodinger equation is 

(H 0 + H, - e)x(r) = 0 = 2 k [(E k < - e)g k - + flig k ]exp(ik' ■ r) , (8) 

where Hi is the interaction energy of the two electrons. Here e is the eigenvalue. 
We take the scalar product with exp((k • r) to obtain 

(Ek-e)g k + 2 k g k , (k|H 1 |k') = 0 , (9) 

the secular equation of the problem. 

Now transform the sum to an integral: 

(E - e)g (E) + JdE' g (E')H r (E,E')N(E') = 0 , (10) 

where N(E') is the number of two electron states with total momentum K = 0 
and with kinetic energy in dE' at £', 

Now consider the matrix elements H 1 (E,£') = (klHJk'). Studies of these by 
Bardeen suggest that they are important when the tw'o electrons are confined to 
a thin energy shell near the Fermi surface — within a shell of thickness Eoj d 
above E F , where oj d is the Debye phonon cutoff frequency. We assume that 

H X (E,E') - —V (11) 

for E,E' within the shell and zero otherwise. Here V is assumed to be positive. 
Thus (10) becomes 

(E - e)g(E) = V I*" dE' g(E')N(E') - C , 

J 2 e „ 

with e m ~ e F + fi(o u . Here C is a constant, independent of E. 


(12) 
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From (12) we have 
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(13) 

he, E - € 

(14) 


and 


With N(E') approximately constant and equal to N F over the small energy 
range between 2e„, and 2e F , we take it out of the integral to obtain 


1 = n f v p 




Let the eigenvalue e of (15) be written as 

e — 2e F - A , 


(15) 


(16) 


which defines the binding energy A of the electron pair, relative to two free 
electrons at the Fermi surface. Then (15) becomes 


1 - N F V log 


2e„ 


2e P + A 


= N F V log 


2 hcop + A 


(17) 


or 


lAVpV = 1 og( 1 + 2ficD D /A) . (18) 

This resrdt for the binding energy of a Cooper pair may be written as 

2 ha> D 


expQAVpV) - 1 ' 


(19) 


For V positive (attractive interaction) the energy of the system is lowered by 
excitation of a pair of electrons above the Fermi level. Therefore the Fermi 
gas is unstable in an important way. The binding energy (19) is closely related 
to the superconducting energy gap The BCS calculations show that a high 
density of Cooper pairs may form in a metal. 


APPENDIX I: GINZBURG-LANDAU EQUATION 


We owe to Ginzburg and Landau an elegant theory of the phenomenology 
of the superconducting state and of the spatial variation of the order parameter 
in that state. An extension of the theory by Abrikosov describes the structure 
of the vortex state which is exploited technologically in superconducting mag- 
nets. The attractions of the GL theory are the natural introduction of the 
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coherence length and of the wavefunction used in the theory of the Josephson 
effects in Chapter 12. 

We introduce the order parameter i/r(r) with the property that 

<A*(rWr) = n s (r) , (1) 

the local concentration of superconducting electrons. The mathematical for- 
mulation of the definition of the function tjj( r) will come out of the BCS the- 
ory. We first set up a form for the free energy density F s (r) in a superconduc- 
tor as a function of the order parameter. We assume that in the general vicinity 
of the transition temperature 

F s ( r) = F n - a\4,\ 2 + + (V2.m)\(-ihV-qA/c)i},\ 2 - f M • dB a , (2) 

Jo 

with the phenomenological positive constants a, p, and rn, of which more will 
be said. Here: 

1. is the free energy density of the normal state. 

2. — aii/rl 2 + ip'.ijfl * is a typical Landau form for the expansion of the free 
energy in terms of an order parameter that vanishes at a second-order phase 
transition. This term may be viewed as —an s + \Pn\ and by itself is a mini- 
mum with respect to n s when n s (T) = a/p. 

3. The term in Igrad i//l 2 represents an increase in energy caused by a spa- 
tial variation of the order parameter. It has the form of the kinetic energy in 
quantum mechanics. 1 The kinetic momentum —iftV is accompanied by the 
field momentum —qAJc to ensure the gauge invariance of the free energy, as 
in Appendix G. Here q = — 2e for an electron pair. 

4. The term — JM • dB a , with the fictitious magnetization M = (B - BJ/4ir, 
represents the increase in the superconducting free energy caused by the ex- 
pulsion of magnetic flux from the superconductor. 

The separate terms in (2) will be illustrated by examples as we progress 
further. First let us derive the GL equation (6). We minimize the total free en- 
ergy jdV F s (r) with respect to variations in the function <p(r). We have 

SF s ( r) = [— aji + p\ip\ 2 ij/ + (1/2 m)(—ihV — qA/c)ijj • (ihV — qA/c)8(jj* + c.c.] (3) 

We integrate by parts to obtain 

Jdv (VM va**) = -Jdv (VVW* , (4) 

if 8i(f* vanishes on the boundaries. It follows that 

6 JdV F s = JdV 5{(/*[~a4> + P + (l/2m)(— ihV —qA/c) 2 ip\ + c.c. (5) 


’A contribution of the form IVMI 2 , where M is the magnetization, was introduced hy Landau 
and Lifshitz to represent the exchange energy density in a ferromagnet; see QTS, p. 65. 
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This integral is zero if the term in brackets is zero: 

[{l/2m)(— ihV — qA/c ) 2 — a + /3 |i/f| 2 ]^ = 0 . 


( 6 ) 


This is the Ginzburg-Landau equation; it resembles a Schrodinger equation 
for ip- 

By minimizing (2) with respect to SA we obtain a gauge-invariant expres- 
sion for the supereurrent flux: 


js(r) = -{iqh/2m){tp*^tp — ipVifi*) - (q 2 /mc) tp*tj/A . (7) 


At a free surface of the specimen we must choose the gauge to satisfy the 
boundary condition that no current flows out of the superconductor into the 
vacuum: ft • j s = 0, where ft is the surface normal. 


Coherence Length. The intrinsic coherence length £ may be defined from 
(6). Let A = 0 and suppose that jSli/'l 2 may be neglected in comparison with a. 
In one dimension the GL equation (6) reduces to 

h 2 d 2 tp 

(8) 


This has a wavelike solution of the form exp (ix/£), where £ is defined by 

£ = (h 2 /2ma) li2 . 


(9) 


A more interesting special solution is obtained if we retain the nonlinear 
term jQl^l 2 in (6). Let us look for a solution with ip = 0 at x = 0 and with ip—*ip 0 
as x —> ao. This situation represents a boundary between normal and supercon- 
ducting states. Such states can coexist if there is a magnetic field H c in the nor- 
mal region. For the moment we neglect the penetration of the field into the 
superconducting region: we take the field penetration depth A < £, which de- 
fines an extreme type I superconductor. 

The solution of 

+ , ( 10 ) 

2m ax 

subject to our boundary conditions, is 

i p(x) = (a/p) m tanh(x/V2£) . (11) 

This may be verified by direct substitution. Deep inside the superconductor 
we have </» 0 = (a/j. 3) 1/2 , as follows from the minimization of the terms -~a\tp\ 2 + 
Wl 4 in the free energy. We see from (11) that £ marks the extent of the co- 
herence of the superconducting wavefunction into the normal region. 

We have seen that deep inside the superconductor the free energy is a 
minimum when \ip n \ 2 = a/fi. so that 

F s = F n ~ a 2 / 2/3 = F„ - , 


( 12 ) 
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by definition of the thermodynamic critical field H c as the stabilization free 
energy density of the superconducting state. It follows that the critical field is 
related to a and /3 by 

H c = (4ira 2 /fi) V2 . (13) 

Consider the penetration depth of a weak magnetic field ( B < H, ) into a 
superconductor. We assume that I (i/I 2 in the superconductor is equal to li// 0 l 2 , 
the value in the absence of a field. Then the equation for the supercurrent flux 
reduces to 

js(r) = -(q 2 /mc)\>p 0 f\ , (14) 

which is just the London equation j s (r) = — (c/4ttA 2 )A, with the penetration 
depth 

(15) 

\4irq 2 |<// 0 | 2 / \4Trq 2 aJ 


The dimensionless ratio k = A/£ of the two characteristic lengths is an 
important parameter in the theory of superconductivity. From (9) and (15) 
we find 


K = 


me | 

( P' 

qh ' 

( v 27T / 


(16) 


We now show that the value k = 1A/2 divides type I superconductors 
(k < 1/V2) from type II superconductors (k > 1/V2). 


Calculation of the Upper Critical Field. Superconducting regions nucle- 
ate spontaneously within a normal conductor when the applied magnetic field 
is decreased below a value denoted by H c *. At the onset of superconductivity 
\ip\ is small and we linearize the GL equation (6) to obtain 


2m 


{—ihV — qA/c) 2 ijj = a 


(17) 


The magnetic field in a superconducting region at the onset of superconduc- 
tivity is just the applied field, so that A = B(0,r,0) and (17) becomes 


fi 

2m 




ih— + —x)ijt=aifi . 
, dy c ) 


(18) 


This is of the same form as the Schrodinger equation of a free particle in a 
magnetic field. 

We look for a solution in the form expfijk^i/ + k z z)]<p(x) and find 
( l/2m) [ — h 2 d 2 /dx 2 + h 2 k \ + ( hk y - qBx/c'f]<p = a<p , 


( 19 ) 
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this is the equation for an harmonic oscillator, if we set E = a — (h 2 /2m) 
(k 2 + k:) as the eigenvalue of 

(1/2 m)[-h z d 2 /dx 2 + (q 2 B 2 /c 2 )x 2 - (2hk y qB/c)x]ip — E<p . (20) 


The term linear in x can be transformed away by a shift of the origin from 
0 to x 0 = hk y qB/2mc, so that (20) becomes, with X = x — x 0 . 


£ J? + faqWmcfir 


ip = (E + h 2 ky2m)<p 


( 21 ) 


The largest value of the magnetic field B for which solutions of (21) exist is 
given by the lowest eigenvalue, which is 

| fuo — hqB m!lx /2mc = a — h 2 k 2 J2m , (22) 

where o) is the oscillator frequency cjB/mc. With k z set equal to zero, 

B mas - H c2 = 2 amc/qh . (23) 


This result may be expressed by (13) and (16) in terms of the thermody- 
namic critical field H r and the GL parameter k — A/£: 


H C 2 = 


2 awe 
qh 


H c „ -v /B me 
(4 m* 2 //?) 1/2 % 


I^H r = V2 kH c . (24) 


When A/f > 1/V2, a superconductor has 11 c3 > ll c and is said to be of type II. 
It is helpful to write II c2 in terms of the flux quantum <l> 0 = 2nhc/q and 
— h 2 /2ma: 

„ _ 2 mca <i*o h 2 _ 

qh ' 2irhc * 2 ma? 2rrf ' 

This tells us that at the upper critical field the flux density H c2 in the material 
is equal to one flux quantum per area 2tt£, 2 , consistent with a fluxoid lattice 
spacing of the order of f . 


APPENDIX J: ELECTRON-PHONON COLLISIONS 


Phonons distort the local crystal structure and hence distort the local band 
structure. This distortion is sensed by the conduction electrons. The important 
effects of the coupling of electrons with phonons are 

• Electrons are scattered from one state k to another state k' , leading to elec- 
trical resistivity. 
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• Phonons can be absorbed in the scattering event, leading to the attenuation 
of ultrasonic waves. 

• An electron will carry with it a crystal distortion, and the effective mass of 
the electron is thereby increased. 

• A crystal distortion associated with one electron can be sensed by a second 
electron, thereby causing the electron-electron interaction that enters the 
theory' of superconductivity. 

The deformation potential approximation is that the electron energy e(k) 
is coupled to the crystal dilation A(r) or fractional volume change by 

e(k,r) = e 0 (k) + CA(r) , (1) 

where C is a constant. The approximation is useful for spherical band edges 
e 0 (k) at long phonon wavelengths and low electron concentrations. The 
dilation may be expressed in terms of the phonon operators « q , « q of 
Appendix C by 

A(r)=i]£(ft/2Afw q ) L ' 2 |q|[a q exp(jq-r)-a q exp(-iq-r)] . (2) 

as in QTS, p. 23. Here M is the mass of the crystal. The result (2) also follows 
from (C.32) on forming^, — in the limit k <S 1. 

In the Born approximation for the scattering we are concerned wnth the 
matrix elements of C A(r) between the one-electron Bloch states |k) and |k'}, 
with |k) = exp(zk • r)u k (r). In the wave field representation the matrix ele- 
ment is 

H' = f d 3 x i|f + (r)CA(r)0(r) = 2 c k 'C k {k' |CA|k) 

kk 

= iC^ c\Cv X (ft't2Mw q )' /2 |q|(a q / d 3 x uTu l .e" 1_k +,i; ' r (3) 

k'k q 

-a+I dh ujn k c ,(k “ k '“ ,l) r ) , 

where 


f(r) = 2 c k ip k (r) = 2 e k exp(ik • r>u k (r) , (4) 

k k 

where t k , e k are the fermion creation and annihilation operators. The product 
u k ,-(r)u k (r) involves the periodic parts of the Bloch functions and is itself peri- 
odic in the lattice; thus the integral in (3) vanishes unless 

k-k'+ _f° 

~ 1 vector in the reciprocal lattice. 

In semiconductors at low temperatures only the possibility zero (N processes) 
may be allowed energetically. 
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Let us limit ourselves to N processes, and for convenience we approximate 
/ d z x « k .« lc by unity. Then the deformation potential perturbation is 

H' = iC |q |(a^Ck +q c k - rt q c k q c k ) . (5) 

kq 

Relaxation Time. In the presence of the electron-phonon interaction 
the wavevector k is not a constant of the motion for the electron alone, but 
the sum of the wavevectors of the electron and virtual phonon is conserved. 
Suppose an electron is initially in the state |k); how long will it stay in that 
state? 

We calculate first the probability o> per unit time that the electron in k will 
emit a phonon q. If n q is the initial population of the phonon state, 

tt>(k - q;n q + 1 |k;n q ) = (2777%) |(k - q;n q + 1 1 H' |k;n q )| 2 S(e k — fia> q - e k q ) (6) 

by time-dependent perturbation theory. Here 

|{k - q;n q + 1 |H'|k ; « q )| z = [C 2 hql2Mc s (n q + 1)] . (7) 

The total collision rate W of an electron in the state |k) with a phonon sys- 
tem at absolute zero is, with n q — 0, 

W = iTTfx: / i t ^ cos J 0 dc t 7 3|5 ' e k- e k _ q - hw q ) , (8) 

where p is the mass density'. 

The argument oi the delta function is 

(2k ■ q - q 2 ) - hc s q = A- (2k ■ q - q 2 ~ qq c ) , (9) 

where q v — 2 hm* c s , with c s the velocity of sound. The minimum value of k for 
which the argument can be zero is k miTI = $(q + q c ), which for q = 0 reduces to 
k min = \q c = m*cjh. For this value of k the electron group velocity v g — k min /m* 
is equal to the velocity of sound. Thus the threshold for the emission of 
phonons by electrons in a crystal is that the electron group velocity should ex- 
ceed the acoustic velocity. This requirement resembles the Cerenkov thresh- 
old for the emission of photons in crystals by fast electrons. The electron 
energy at the threshold is |m*c 2 ~ 10 -2 ' • 10" — 10 -lfi erg ~ 1 K. An electron 
of energy below this threshold will not be slowed down in a perfect crystal at 
absolute zero, even by higher order electron-phonon interactions, at least in 
the harmonic approximation for the phonons. 

For k > q c we may neglect the qq c term in (9). The integrals in (8) become 

f dp, f dq q 3 (2m*/h 2 q)8(2kp — q) — (8m*/ft 2 ) f dpk 2 p? = 8m*k 2 /3h 2 , (10) 
J -i Jo 



and the phonon emission rate is 


W(emission) = ^ m \ , (11) 

2>irpc s n 

directly proportional to the electron energy e t . The loss of the component of 
wavevector parallel to the original direction of the electron when a phonon is 
emitted at an angle 6 to k is given by q cos 6. The fractional rate of loss of k z is 
given by the transition rate integral with the extra factor ( q/k ) cos 0 in the inte- 
grand. Instead of (10), we have 

(2 m*/h 2 k) f d/i 8k 3 /i 4 = 16m*k 2 /5h 2 , (12) 

Jo 

so that the fractional rate of decrease of k z is 

W(k z ) = 4C 2 m‘k 2 /5 rrpcji 2 . (13) 

This quantity enters into the electrical resistivity. 

The above results apply to absolute zero. At a temperature k B T S> hc s k the 
integrated phonon emission rate is 

, CSn'kk^T . . 

^(emission) = ; — . (14) 

TTC;ph 3 

For electrons in thermal equilibrium at not too low temperatures the required 
inequality is easily satisfied for the rms value of k. If we take C = 1 0 ! 2 erg; 
m* = KT 27 g; k = 10' cm -1 ; c, = 3 X 10’’ cm s _I ; p — 5 g cm -3 ; then W — 10 12 s _1 . 
At absolute zero (13) gives W — 5 X 10 10 s~ 3 with these same parameters. 
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Absolute thermoelectric 
power, 215 

Acceptor ionization energies, 212 
Acceptor states, 211 
Acoustical phonon branch, 95 
Activation energy, 589 
Adiabatic demagnetization, 312 
Aharonov-Bohm effect, 543 
Alfv6n waves, 425 
Alkali halide crystals, table, 66 
Alloys. 621 
strength, 613 
transition metal, 634 
Alnico V, 353 
Amorphous solids, 568 
ferromagnets, 575 
semiconductors, 577 
Anharmonic interactions, 119 
Anisotropy energy, 348 
Annihilation operators, 651 
Antiferroelectric crystals, 478, 

479 

Antiferromagnetism, 340 
magnons, 344 
N eel temperature, 343 
Anti-stokes line, 445 
Atomic force microscope, 526 
Atomic form factor, 41 
Atomic radii, 70 
table, 71 

Band bending, 507 
Band gap, 165, 187 
Band structure, 
germanium, 203 
Barium titanate, 470 
Basis, 4, 5 

BCS theory, 270, 277 

Biomagnetism, 354 

Bloch equations, 369 

Bloch frequency, 217 

Bloch function, 167 

Bloch oscillator, 217 

Bloch theorem, 173 

Bloch T 3/2 law, 334 

Bloch wall, 349 

Bohr magneton, 303 

Boltzmann transport equation, 656 

Boson operators, 651 


Boundary conditions, 
periodic, 110 
Bragg law, 25 
Bravais lattice, 8 
Brillouin function, 304 
Brillouin scattering, 428 
Brillouin zone, 33, 223, 252 
first, 44, 93, 224 
volume, 44 
Bulk modulus, 80 
Bulk modulus, electron gas, 157 
Burgers vector, 604 
Biittiker-Landauer formalism, 540 


Cauchy integral, 431 
Causality, 450 
Cell, primitive, 6 
unit, 8 

Wigner-Seitz, 34 
Centipoise, 573 
Central equation, 174 
Cesium chloride structure, 14 
Chalcogenide glasses, 577 
Charge density waves, 424 
Charging energy’, 549 
Chemical potential, 137, 157 
Classical distribution, 658 
Clausius-Mossotti relation, 464 
Clogston relation, 272 
Coercive force, small 
particle, 358 
Cocrcivity, 347, 352 
Coherence length, 276, 669 
intrinsic, 277 

Cohesive energy, 49, 59, 73 
Fermi gas, 159 
Sodium metal, 237 
Square well potential, 253 
Cohesive energy, table, 50 
Collisionless electron gas, 433 
Color centers, 592 
Compressibility, 80 
Concentration, table, 21 
Conductance, quantum, 534 
Conduction electron, 
ferromagnetism, 320 
Conduction band edge, 190 
Conduction electrons, 315 
susceptibility, 315 


Conductivity, electrical, 147, 

209, 420 
impurity, 209 
ionic, 420 
thermal, 156 

Conductivity sum rule, 450 
Configurational heat capacity, 640 
Contact hyperfine interaction, 374 
Cooper pairs, 279, 556, 665 
Coulomb blockade, 551 
Covalent bond, 67 
Covalent crystals, 67 
Critical points, 434 
Continuum wave equation, 103 
Creation operators, 651 
Creep, 615 

Critical field, 262, 295 
thin films, 295 
Critical shear stress, 599 
Critical temperature, 
ferroelectric, 469 
Crystallography, surface, 490 
Crystal field, 309 
splitting, 309 

Crystal momentum, 100, 173 
Crystal structure, elements, 20 
Cubic lattices, 10 
Cubic zinc sulfide structure, 17 
Curie constant, 305 
law, 305 

Curie-Weiss law, 324 
Cyclotron frequency, 153 
Cyclotron resonance, 200, 219 
spheroidal energy surface, 219 

Dangling bonds, 488 
Davydov splitting, 452 
Debye model, density of 
states, 112 

Debye temperature, 112 
table, 116 
Debye T‘ law, 114 
Debye-Waller factor, 642 
Defects, paramagnetic, 375 
Deficit semiconductors, 209 
Degenerancy, 135 
Degenerate semiconductor, 409 
De Haas-van Alphen effect, 244 
period, 253 
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Demagnetization factors, 380 
Demagnetization, 
isentropic, 312 
Density of states, 108, 149 
Debye model, 112 
Einstein model, 114 
finite system, 520 
general result, 117 
one dimension, 108 
singularity, 119 
three dimensions, 11 1 
Density, table, 21 
Depolarization factors, 458 
Depolarization field, 485 
Diamagnetism, 299 
Diamond structure, 16, 45, 

182, 187 

Dielectric constant, 463 
semiconductors, 211 
susceptibility, 459 
Dielectric function, 429 
electron gas, 433 
Lindhard, 406 
Tliomas-Fermi, 405 
Diffraction conditions, wave, 25 
Diffraction, Josephson 
junction, 282 
Diffusion, 371, 588 
Diffusivity, 588 
Dilation, 75 

Direct gap semiconductor, 
table, 201 

Direct photon absorption, 189 
Dislocation densities, 610 
Dislocation multiplication, 611 
Dislocations, 601 
Dispersion relation, phonons, 92 
electromagnetic waves, 92, 397 
Displacive transition, 471 
Dissipation sum rule, 450 
Distribution, classical, 130 
Fermi-Dirac, 107 
Planck, 107 
Dog’s bone orbit, 250 
Domains, origin of, 350 
closure, 35f 

Donor ionization energies, 21 1 
Donor states, 209 
Doping, 209 

Dulong and Petit value, 1 1 7 


Edge dislocation, 601 
Effective mass, 197 
negative, 199 

semiconductors, table, 201 


Einstein model, density of 
states, 114 
thermal, 145 
Elastic stiffness, 78, 84 
Elastic strain, 73 
Elastic wave quantization, 80 
Electrical resistivity, 148 
table, 149 

Electric field, local, 460 
macroscopic, 456 
Electric conductivity, 147, 661 
table, 149 

Electric quadruple moment, 379 
Electronic polarizabilities 
table, 465 

Electronic structure, surface, 494 
Electron affinities, 62 
Electron beam lithography, 521 
Electron compound, 624, 625 
Electron-electron collisions, 417 
Electron heat capacity 7 , table, 148 
Electron-hole drops, 441, 443 
Electron-electron interaction, 417 
one dimension, 532 
Electron-lattice interaction, 420 
Electron orbits, 230 
Electron-phonon collisions, 671 
Electron spin resonance, 362 
Electron work functions, 494 
tabic, 494 

Electrostatic screening, 403 
Elementary excitations, 90 
Empirical pseudopotential 
method, 239 
Empty core model, 240 
Empty lattice approximation, 176 
Energy 7 band calculation, 232 
Energy bands, 163, 164 
Energy gap, 165, 167 
superconductors, 266, 268 
table, semiconductors, 189 
Energy levels, one dimension, 134 
Energy loss, fast particles, 448 
Entropy, superconductors, 265 
Entropy of mixing, 631 
Equations of motion, 
electron, 191 

Equation of motion, hole, 196 
ESCA, 447 
Eutectics, 632 

Ewald-Komfeld method, 647 
Ewald construction, 32 
sphere, 493 
sums, 644 

Exchange energy, 326 
field, 325 


frequency resonance, 391 
integral, 325 
interaction, 325 
narrowing, 371, 386 
Exclusion principle, Pauli, 56 
Exotic superconductors, 147 
Extremal orbits, 248 
Excitons, 435 

Extended zone scheme, 226 
Extinction coefficient, 429 
Extremal orbits, 248 

Factor, atomic form, 41 
structure, 45 
F centers, 376, 592, 595 
Fermi-Dirac distribution, 

136, 652 

Fermi energy, 135 
Fermi gas, 134 
Fermi level, 137 
Fermi liquid, 417 
Fermi surface, copper, 249 
Fermi surface, gold 
Fermi surface parameters, 
table, 139 

Fermi surfaces, 225 
Fermions, heavy, 147 
Ferroelectric crystals, 467 
domains, 479 
linear array, 485 
Ferromagnetism, crystals, 

321, 328 
amorphous, 575 
conduction electron, 320 
domains, 345 
order, 336 
resonance, 379 
Fiber optics, 581 
Fick’s law, 588 
Fine structure constant, 499 
First Brillouin zone. 36, 93, 165 
First-order transition, 477 
Flaxoid, 281 
Flux quantization, 279 
Fourier analysis, 27, 39, 169 
Fractional quantized Hall effect 
(FQHE), 503 
Frank bead source, 612 
F ree-energy, 
superconductors, 267 
Frenkel defect, 566, 595 
Frenkel cxciton, 437 
Friedel oscillations, 407 
Fullerenes, 262 
Fused quartz, 570 
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Gap, direct, 190 
Gap, indirect, 190 
Gap plasmons, 426 
Gauge transformation, 664 
Geomagnetism, 354 
Ginzburg-Landau equation, 667 
Ginzburg-Landau parameter, 667 
GLAG theory, 283 
Glass, 573 

transition temperature, 573 
Grain boundary, small-angle, 607 
Group velocity, 94 
Griinciscn constant, 129 
Gyromagnetic ratio, 302 
Gyroscopic equation, 366 

Hagen-Rubens relation, 451 
Hall coefficient, 154 
table, 155 
Hall effect, 153 
ballistic junction, 542 
two-carrier types, 218 
Hall resistance, 155 
Hall resistivity, 498 
Hardness, 617 
Heavy fermions, 147 
Heat capacity, configurational, 640 
electron gas, 141 
glasses, 378 
metals, 145 
one dimension, 561 
phonon, 107 
superconductors, 264 
He 3 liquid, 158 
Heisenberg model, 325 
Helicon waves, 425 
Heterojunction, 508 
Heterostnictures, 507 
Hexagonal close-packed 
structure, 15 
Hexagonal lattice, 44 
High temperature super- 
conductors, 293 
Hole, equation of motion, 

104, 194 
Hole orbits, 230 
Hole trapping, 422 
Hooke’s law, 73 
HTS, 293 

Hume-Rothery rules, 624 
Hund rules, 306 
Hydrogen bonds, 70 
Hyperfine constant, 375 
Hyperfine effects, ESR in 
metals, 391 


Hyperfine splitting, 373 
Hysteresis, 352 

Ilmenite, 470 
Impurity conductivity, 209 
Impurity orbits, 218 
Index system, 10 
Indirect gap, 190 
Indirect photon absorption, 189 
Inelastic scattering, phonons, 100 
Inert gas crystals, table, 53 
Injection laser, 510 
Insulators, 181 
Interband transitions, 434 
Interface plasmons, 425 
Interfacial polarization, 484 
Intrinsic carrier concentration, 
188, 205 

Intrinsic coherence length, 277 
Intrinsic mobility, 208 
Intrinsic semiconductors, 187 
Inverse spinel, 338 
Ionic bond, 60 
Ionic character, table, 69 
Ionic conductivity, 420, 

Ionic crystals, 6 
Ionic radii, 72 
table, 71 

Ionization energies, acceptor, 209 
donor, 209 
table, 54 
Iron garnets, 339 
Iron group, 307 

Isentropic demagnetization, 312 
Isotope effect, 269 
superconductors, 269 
Isotope effect, thermal 
conduction, 127 


Jahn-Teller effect, 209 
trapping, 420 
Josephson tunneling, 289 
Jump frequency, 590 
Junction, superconducting, 287 
Junctions, p-n, 503 

k • p perturbation theory, 253 
Kelvin relation, 215 
Knight shift, 377, 378 
Kohn anomaly, 103 
Kramers- Heisenberg 
dispersion, 431 

Kramers-Kronig relations, 430, 432 


Kronig-Penney model, 174, 182 
reciprocal space, 174 

LA modes, 95 
Lagrangian equations, 662 
Landauer formula, 535 
Landau gauge, 503 
Landau-Ginzburg equations, 276, 
667 

Landau level, 245, 254, 493 
Landau theory, Fermi liquid, 417 
Landau theory, phase 
transition, 474 
Langevin diamagnetism, 299 
Langevin result, 301 
Lanthanide group, 306 
Larmor frequency, 300 
Larmor theorem, 300 
Laser, 389 
injection, 510 
ruby, 389 

semiconductor, 510 
Lattice, Bravais, 8 
Lattice constants, equilibrium, 58 
Lattice frequencies, table, 41 6 
Lattice momentum, 193 
Lattice sums, dipole arrays, 647 
Lattice types, 8,10 
Lattice vacancies, 585 
Lattices, cubic, 10 
Laue equation, 33, 513, 641 
Law of mass action, 206 
LCAO approximation, 233 
LEED, 511 

Lennard-Jones potential, 58 
Lenz’s law, 299 

Lindhard dielectric function, 406 
Line width, 370 
Liquid He' 1 , 158 
Liquidus, 632 
Localization, 539 
Local electric field, 460 
London equation, 273, 665 
London gauge, 665 
London penetration depth, 

275, 294 

Longitudinal plasma 
oscillations, 398 

Longitudinal relaxation time, 366 
Long-range order, 630 
LO modes, 95 
Liorentz field, 462 
Lorenz number, 157 
Low-angle grain boundary, 607 
LST relation, 414, 416 
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Luminescence, 437 
Luttinger liquid, 532 
Lyddane-Sachs-Tellcr relation, 414 

Madelung constant, 64 
Madelnng energy, 60 
Magnetic breakdown, 251 
Magnetic force microscopy, 

355, 527 

Magnetic susceptibility, 298, 
315,318 
Magnetite, 337 
Magnetoconductivity, tensor, 

159, 254 

Magnetocrystalline energy, 347 
Magnctoelastic coupling, 357 
Magnetogyric ratio, 302, 363 
Magneton numbers, 308 
Magnetoplasma frequency, 425 
Magnetoresistance, giant, 359 
Magnetoresistance, 498 
two carrier types, 219 
Magnetic resonance, 361 
Magnetotaxis, 355 
Magnetization, 303 
saturation, 304 
Magnons, 330 
antiferromagnetic, 344 
dispersion relation, 357 
Maser action, 386 
Matthiessen’s rule, 150 
Maxwell equations, 455 
Mean field approximation, 323 
Meissner effect, 259 
sphere, 296 

Melting points, table, 51 
Melt-spin velocity, 576 
Mesh, 490 

Mesoscopic regime, 543 
Metal-insulator transition, 407 
Metals, 69 
Metal spheres, 484 
Metglas, 576 
Microelectromechanical 
systems, 561 
Miller indices (see index 
system), 11 
Mobility edges, 501 
Mobility gap, 501 
Mobility, intrinsic, 208 
Mobilities, table, 208 
Molecular crystals, 440 
Molecular hydrogen, 68, 86 
Momentum crystal, 100, 173 
field, 661 


lattice, 193 
phonon, 100 
MOSFET, 497 

Motional narrowing, 371, 373 
Mott exciton, 441 
Mott transition, 407 
Mott-Wannier excitons, 441 


np product, 206 
Nanocrystals, 517 
fluorescence, 522 
energy levels, 545 
Nanostructures, 517 
Nanotubes, 518 
density of states, 529 
band structure, 562 
Neel temperature, 341, 342, 343 
Neel wall, 358 
Negative effective mass, 199 
Neutron diffraction, 45 
NMR tomography, 363 
Noncrystalline solid, 519 
Nonideal structures, 18 
Normal mode enumeration, 108 
Normal processes, 124, 125 
Normal spinel, 338 
Nuclear demagnetization, 314 
Nuclear magnetic resonance, 363 
table, 365 

Nuclear magneton, 364 
Nuclear paramagnetism, 314 
Nuclear quadrupole resonance, 379 


Ohm’s law, 147, 538 
Open orbits, 230 
magnetoresistance, 254 
Optical absorption, 190, 521 
Optical microscopy, 521 
Optical phonon branch, 95 
Optical phonons, soft, 473 
Orbit, dog’s bone, 250 
Order-disorder transformation, 627 
Order, long-range, 630 
short-range, 631 
Order parameter, 668 
Oscillations, Friedel, 407 
Oscillator strength, 466 


p-n junctions, 503 
Paramagnetic defects, 375 
Paramagnetism, 302 
conduction electrons, 315 
Van Vleck, 311 


Particle diffusion, 657 
Pauli exclusion principle, 56 
Pauli spin magnetization, 316, 319, 
377 

Peierls instability, 422, 532 
insulator, 424 
Peltier coefficient, 215 
Penetration depth, London, 296 
Periodic boundary conditions, 110 
Periodic zone scheme, 225 
Perovskite, 470 
Persistent currents, 282 
Phase diagrams, 625, 632 
Phase transitions, structural, 467 
Phonons, 100, 101, 107 
coordinates, 649 
dispersion relations, 117 
gas, thermal resistivity, 123 
heat capacity, 107 
inelastic scattering, 100 
mean free path, 122 
metals, 409 
modes, soft, 103 
momentum, 100 
Photolithography, 522 
Photovoltaic detectors, 506 
Piezoelectricity, 481 
Planck distribution, 107 
Plasmon frequency, 90, 397, 398 
interface, 425 
mode, sphere, 425 
optics, 396 
oscillation, 398 
surface, 403, 424 
Poise, 573 

Poisson equation, 403 
Poisson’s ratio, 87 
Polaritons, 410 
Polarizability, 463 

conducting sphere, 484 
electronic, 464 
Polarization, 455 
interfacial, 484 
saturation, 484 

Polaron, coupling constant, 420, 
422, 426 
Polytypism, 19 
Power absorption, 370 
Primitive cell, 5, 6, 180 
p-n junctions, 503 
Pseudopotential, 

components, 239 
metallic sodium, 240 
method, 239 
Pyroelectric, 469 
P'ZT system, 479, 481 
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Quantization, elastic wave, 

99, 648 

orbits in magnetic field, 242 
spin waves, 382 
Quantum corral, 525 
Quantum dots, 517, 545 
charge states, 549 
Quantum interference, 282, 292 
Quantum solid, 85 
Quantum theory, 
diamagnetism, 301 
paramagnetism, 302 
Quasi-Fermi levels, 510 
Quasi particles, 417 
Quenching, orbital angular 
momentum, 308 
QHE, 499 

Radial breathing mode, 558 
Radial distribution function, 569 
Raman effect, 444 
scattering, 428 
surface enhanced, 549 
nanotubes, 558 
Random network, 572 
Random stacking, 19 
Rare earth ions, 305 
Rayleigh attenuation, 582 
Reciprocal lattice points, 29 
Reciprocal lattice vectors, 29 
Recombination radiation, 442 
Reconstruction, 489 
Rectification, 504 
Reduced zone scheme, 223 
Reflectance, 430 

Reflection, normal incidence, 450 
Reflectivity coefficient, 429 
Refractive index, 429 
Relaxation, 366 
direct, 368 
Orbach, 368 
Raman, 368 
Relaxation time, 
longitudinal, 366 
spin-lattice, 366 
Remanence, 347 
Resistance per square, 159 
Resistance quantum, 534 
Resistance, surface, 159 
Resistivity, electrical, 147, 159 
Resistivity ratio, 150 
Resonant tunneling, 538 
Response, electron gas, 426 
Response function, 430 
Rf saturation, 391 


RHEED, 493 
Richardson-Dushman 
equation, 495 
RKKY theory, 638 
Rotating coordinate system, 391 
Ruby laser, 389 

Saturation magnetization, 326 
Saturation polarization, 485 
Saturation rf, 391 
Scanned probe microscopy, 520 
Scanning electron microscope, 521 
Scanning tunneling 
microscope, 523 
Schottky harrier, 506 
Schottky defect, 585 
Schottky vacancies, 585, 595 
Screened Coulomb potential, 406 
Screening, electrostatic, 403 
Screw dislocation, 603 
Second harmonic generation, 549 
Second-order transition, 475 
Self-diffusion, 591 
Self-trapping, 209 
Semiconductor crystals, 187 
Semiconductor, degenerate, 409 
Semiconductor lasers, 510 
Semiconductors, deficit, 209 
Semimetals, 162, 215 
Shear constant, 87 
Shear strength, single crystals, 599 
Shear stress, critical, 599 
Short-range order, 631 
Single-domain particles, 353, 358 
Single-electron transistor, ,551 
Singularities, Van Hove, 119 
Slatcr-Pauling plot, 636 
Slip, 600 

Sodium chloride structure, 13 

Sodium metal, 132 

Soft modes, 474, 485 

Solar cells, 506 

Solidus, 632 

Solubility gap, 632 

Spheres, metal, 484 

Spectroscopic splitting factor, 31 1 

Spinel, 337 

Spin-lattice interaction, 367 
Spin-lattice relaxation time, 366 
Spin wave (see also 
magnon), 330 
quantization, 333 
resonance, 382 
Square lattice, 8, 182 
Stability criteria, 88 


Stabilization free energy, 272 
Stacking fault, 601 
STM, 523 
Stokes line, 445 
Strain component, 75 
Strength of alloys, 613 
Stress component, 75 
Structural phase transitions, 467 
Structure factor, 39 
bcc lattice, 40 
diamond, 45 
fee lattice, 40 
Suhstrate, 489 

Surface plasmon resonance, 547 
Superconductivity, table, 261 
type I, 259 
type II, 283 
Superlattices, 628, 640 
Superparamagnetism, 354 
Surface crystallography, 490 
surface electronic structure, 494 
surface nets, 490 
surface plasmons, 279, 302 
surface resistance, 159 
surface states, 495 
surface transport, 497 
Susceptibility, dielectric, 459 

TA modes, 95 
Temperature, Debye, 112 
Temperature dependence, 
reflection lines, 643 
Tetrahedral angles, 22 
Thermal conductivity, 121, 156 
glasses, 534 
isotope effect, 127 
metals, 156 
one-dimension, 561 
tabic, 116 

Thermal dilation, 128 
Thermal effective mass, 145 
Thermal excitation, magnons, 334 
Thermal expansion, 120 
Thermal ionization, 213 
Thermal resistivity, phonon 
gas, 133 

Thermionic emission, 495 
Thermodynamics, 

superconducting transition, 270 
Thermoelectric effects, 214 
Thomas-Fermi approximation, 403 
Thomas-Eermi dielectric 
function, 405 
Three-level maser, 388 
Tight-binding method, 232 
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TO modes, 95 
Tomography, magnetic 
resonance, 363 
Transistor, MOS, 507 
Transition, displacive, 471 
first-order, 477 
metal insulator, 107 
order-disorder, 471 
second-order, 475 
Transition metal alloys, 634 
Transition temperature, glass, 573 
Transistor, MOS, 497 
Translation operation, 4 
Translation vector, 4 
Transmission electron 
microscopy, 520 
Transmission probability, 534 
Transparency, alkali metals, 398 
Transport, surface, 497 
Transverse optical modes, 
plasma, 398 

Transverse relaxation time, 368 
Triplet excited states, 318 
Tunneling, 287 
Josephson, 289 


Tunneling probability, 524 
Twinning, 601 
Two- flu id model, 295 
Two-level system, 320 
Type I superconductors, 264 
Type II superconductors, 264, 283 


Ultraviolet transmission 
limits, 399 

Umklapp processes, 125 
Unit cell, 6 

Upper critical field, 670 
UPS, 447 


Valence band edge, 190 
Van der Waals interaction, 53 
Van Hove singularities, 119, 528 
Van Vleck paramagnetism, 311 
Vector potential, 661 
Vickers hardness number, 618 
Viscosity, 574 
Vitreous Silica, 570 
Vortex state, 264, 284, 295 


Wannier functions, 254 
Wave equation, continuum, 103 
periodic lattice, 169 
Weiss field, 323 
Whiskers, 616 
W'iedemann-Franz law, 156 
Wigner-Seitz boundary 
condition, 237 

Wigner-Seitz cell, 6, 8, 34, 238 
Wigner-Seitz method, 236 
Work function, 494 
Work-hardening, 614 

XPS, 447 


Young’s modulus, 87 
Yttrium iron garnet, 381 

Zener breakdown, 217 
Zener tunneling, 217 
Zero-field splitting, 386 
Zero-point motion, 56, 85 



Table of Values 


Quantity 

Symbol 

Value 

CCS 

SI 

Velocity of light 

c 

2.997925 

10'° cm s 1 

10 8 m s 1 

Proton charge 

e 

1.60219 

4.80325 

JO" 10 esu 

io- 19 C 

Planck’s constant 

h 

6.62620 

IO' 27 erg s 

IO" 34 J s 


fi — h/2-ir 

1.05459 

10" 27 erg s 

10“ 34 J s 

Avogadro’s number 

N 

6.02217 x 10 23 moP 1 





Atomic mass unit 

amu 

1.66053 

10 24 g 

10 27 kg 

Electron rest mass 

m 

9.10956 

10 28 g 

10 3 ‘ kg 

Proton rest mass 

M P 

1.67261 

10- 24 g 

10- 27 kg 

Proton mass/electron mass 

M p lm 

1836.1 

— 

— 

Reciprocal fine structure 

1 la 

137.036 





constant fide 2 

Electron radius e 2 imc l 


2.81794 

10 13 cm 

IO' 15 m 

Electron Compton 

K 

3.86159 

10- " cm 

IO' 13 m 

wavelength hlmc 

Bohr radius fc 2 /me 2 

r a 

5.29177 

10 9 cm 

10 11 m 

Bohr magneton ehl2mc 

H-b 

9.27410 

10 21 erg G-‘ 

10 24 ] T"‘ 

Rydberg constant me i l2fi 2 

floe or Ry 

2.17991 

13.6058 eV 

10" erg 

10 18 J 

1 electron volt 

eV 

1.60219 

10- 12 erg 

10-' 9 J 


eVlh 

2.41797 X 10 14 Hz 

— 

— 


cVlhc 

8.06546 

10 3 cm * 

10 5 rn" 1 


eV/k fl 

1.16048 x 10 4 K 

- 

- 

Boltzmann constant 

^ B 

1.38062 

IO" 16 erg K~‘ 

10-23 J K 1 

Permittivity of free space 

Co 

- 

1 

10 7 /4ttc 2 

Permeability of free space 

Po 


1 

4v x 10” 7 


Source. B. N. Taylor, W, H. Parker, and D N. Langenbcrg, Hev. Mud. Phys.41, .'175 (1969). See also E. R. Cohen and B. N. Taylor, Journal 
of Physical and Chemical Reference Data 2(4), 663 (1973). 
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